Skip to main content

Full text of "Biometrika"

See other formats


Pee 

ese any Sepa Ne be MN Pee ane era 

t= een iiethe naan br Ot nde 8 al nfo tre A — 

Faia bales We Mediailacind nA nde Ast thats dood a inept toner AI 
Tn amaneet 


we Bhace ty et 
a peur Steal 


Paanrernenteren we Weyy sya 
Se 


a tlt dete ft 


i= Balin tee 


alban anh 
eer 


pt 
bia Rae 
Se A 
Ba 


if 
OR RRR A by 
i) San 


ag) by?) hs f 
i pales ¥ uno 
ile vat 


i 


Dats Nye 
NN ey 
Bin 


a) 
ya 


sO vib eK A 


A JOURNAL FOR THE STATISTICAL STUDY OF 
BIOLOGICAL PROBLEMS 


FOUNDED BY 
‘W. F. R. WELDON, FRANCIS GALTON anp KARL PEARSON 


KDITED BY 


KARL PEARSON 


VOLUME X 


APRIL 1914 TO May 1915 


CAMBRIDGE 
AT THE UNIVERSITY PRESS 


LONDON: FETTER LANE, E.C. 
(C. F. CLAY, Manacer) 
anD H. K. LEWIS, GOWER STREET 
WILLIAM WESLEY AND SON, 28, ESSEX STREET, STRAND 
EDINBURGH : 100, PRINCES STREET 
CHICAGO : UNIVERSITY OF CHICAGO PRESS 
BOMBAY, CALCUITA AND MADRAS: MACMILLAN AND CO., LIMITED 
TORONTO: J, M. DENT AND SONS, LIMITED 
TOKYO: THE MARUZEN-KABUSHIKI-KAISHA 


[All rights reserved] a, 32 


Zot 


i 
‘ = ‘aed 
Ape = = 
a é 
"s 
i ue) % 


t 


: Cambridge: 


PRINTED BY JOHN CLAY, M.A. 


* 


AT THE UNIVERSITY PRESS. 


= 
+ tee 
. 
P = 
. 
‘ 
: 
. 
. 
? 
ie mm 
; Rec a) 
ee oe 
ie at . 
a 
2 @—: = 


VIL 
VIII. 


ail 


XIII. 


CONTENTS OF VOL. X. 


Memoirs. 


Congenital Anomalies in a Native African Race. By HuGH STANNUS 
STANNUS 


Tables of Poisson’s Exponential Binomial Limit. . By H. E. Soper 
On the Poisson Law of Small Numbers. By Lucy Wairaker 
The Relationship between Weight of the Seed Planted and the 

Characteristics of the Plant Produced. By J. ARTHUR HARRIS 


On the Probability that two Independent Distributions of Frequency 
are really Samples of the same Population, with special Reference 
to Recent Work on the Identity of Trypanosome Strains. By 
KARL PEARSON 


On Homotyposis and Allied Characters in Eggs of the Common 
Tern. By WitittAm Rowan, K. M. Parker and JULIA BELL 
A Piebald Family. By E. A. CocKAYNE 


Clypeal Markings of Queens, Drones and Workers of Vespa ae Ws. 
By Oswatp H. Larrer : : : 

Table of the Gaussian “Tail” Functions; when the “Tail” is 
larger than the Body. By Atice LEE 

Contribution to a Statistical Study of the Cruciferae. Variation 
in the Flowers of Lepidiwm draba Linnaeus. By James J. 
SIMPSON 

Nochmals iiber “The Elimination of Spurious Correlation due to 
Position in Time or Space.” Von O. ANDERSON 


Statistical Notes on the Influence of Education in Egypt. By 
M. Hosny 


Height and Weight of School Children in Glasgow. By Ernen M. 
ELDERTON . 


PAGE 


269 


280 


288 


1V 


XIV. 


XV. 


XVI. 


Contents 


Numerical Illustrations of the Variate Difference Correlation 
Method. By Bearrice M. Cave and Kari PEARSON 


An Examination of some Recent Studies of the Inheritance Factor 
in Insanity. By Davip HERON 


On the Probable Error of the Bi-Serial Expression for the Corre- 
lation Coefficient. By H. E. Soper 


XVII. On the Partial Correlation Ratio. Part I, Theoretical. By L. 


ISSERLIS 


XVIII. Association of Finger-Prints. By H. WaItre 


XIX. 
XX. 


XXI. 


On the Problem of Sexing Osteometric Material. By KARL PEARSON 


Further Evidence of Natural Selection in Man. By Eruet M. 
ELDERTON and KARL PEARSON 


Frequency Distribution of the Values of the Correlation Coefficient 
in Samples from an indefinitely large Population. By R. A. 
FISHER : 


XXII. Appendix to Papers by “Student” and R. A. Fisher. HprIroRra 


XXIII. Tuberculosis and Segregation. By Atice LEE : : 
XXIV. The Influence of Isolation on the Diphtheria Attack- and Death- 


(1) 


(11) 
(iii) 


(iv) 


(Vv) 


(vi) 


(vil) - 


rates. By Eraet M. ELDERTON and KARL PEARSON 


Miscellanea. 


The Statistical Study of Dietaries, a Reply to Professor Karl Pearson. 
By D. Noet Paton 


The Statistical Study of Dietaries, a Rejoinder. By Kart PEARSON 


Note on the Essential Conditions that a Population breeding at 
random should be in a Stable State. By Kart PEARSON 
The Elimination of Spurious Correlation due to Position in Time or 
Space. By “Srupentr” 

On certain Errors with regard to Multiple Correlation occasionally 
made by those who have not adequately studied the Subject. 
By Kart PEARSON 


Formulae for the Determination of the Capacity of the Negro Skull 
from External Measurements. By L. IsseRLIs 


Note on a Negro Piebald. By C. D. Maynarp 


169 
72 


175 


179 


181 


188 
193 


(viii) 
(1x) 
(x) 
(x1) 
(x11) 


(xiii) 


(xiv) 
(xv) 


Contents 


Note on Infantile Mortality and Employment of Women. By ETHEL 
M. ELDERTON 


Announcement of Prize Essay by Professor F. M. URBAN 
Corrigendum, Congo Female Crania in Vol. vu. p. 307 


On Spurious Values of Intra-class Correlation Coefficients arising from 
Disorderly Differentiation within the Classes. By J. ARTHUR 
HARRIS 


Correction of a Teena made by Mr MAJOR GREENWOOD, 
Junior. By K. P. 


Note on Reproductive Selection. by Dain HERON 


On the Probable Error of a aot Coefficient. By Karn 
PEARSON 


On Medieval English Femora, A Reply to Professor ee By 
KARL PEARSON . ; ; 


Plates. 


Plate I. (1) Samuti, an Ateliotic Dwarf. (2) Subgiant, Height 
1:92 m., with Wife and Albinotic Child. (8)—(4) Etimu, 


aged 25, an Achondroplasic Dwarf . 
Plate II. (5)—(6) Masimosya, aged 19, Gynaecomastos, cae Shek 


features which were formerly described as those of Partial 
Hermaphroditism. (7) oe ee Idiot. (8) Case of 


Hydrocele testis 


Plate III. (9)—(11) Boy, Heol 15 years, showing Siac only 
Plate IV. (12)—(13) Son of Matikwiri, aged 7, a case of Scapho- 
cephaly. (14) Cases of Umbilical Hernia. (15) poet Hand 


and Foot in a Child, aged 5 


Plate V. (16) Case showing faint es nepresion of 1 upper 
lip. (17) Blantyre boy, aged 10, with Hare-lip. (18) Young 
Woman with two nipples on left breast. (19) Gobedi, aged 22, 


with congenital Humeral Micromely . 


Plate VI. (20) Ndala, Split Hand, left only. (21) Chibisa, aged 


30, elongation of all segments of middle finger and its meta- 
carpal bone. (22) Shortening of the fourth metatarsal bone. 
(23) Case of Se Oe Shortening of the left ce 
toe 


Plate VII. Transition of Poisson’s disor Limit of the Bi- 


nomial Series into the Gaussian 


. to face p. 24 


24 


24 


24 


24 


24 


vi Contents 


Plate VIII. Sample Eggs, Common Tern, Natural Size (In colours) to face p. 146 


Plate IX. Types of Mottling of Eggs of Common Tern . i . 146 
Plate X. Dr Maynard’s Piebald Negro. i : ; : a 193 
Plate XI. Pedigree of Cockayne’s Piebald Family . : >» 200 


Plate XII. | Photographs of members of Cockayne’s Piebald Family Fr , 
Plate XIII. .) ip ‘ 


Plate XIV. $ 3 5 9 % » » 
Plate XV. ” ” » »” ” ” 7) 
Plate XVI. " 3 3 5) 9 » » 
Plate XVII. Right forearm of a member of Cockayne’s Piebald 

Family to show leucotic patches. : » » 
Plate XVIII. © Leucotic forehead, blaze and white ca of ae 

in a member of Cockayne’s Piebald Family f i 4 


Plate XIX. Model of Skew Regression Surface giving mean Weight 
of Girls of Class B in Glasgow Schools for a given Height and 
Age. : ; ; : ; ee 49) 


Plate XX. Types of Finger-Prints : : > eee 


Corrigenda and Addenda. 


The following corrections and additions to the memoir on the Partial Correlation Ratio, 
pp. 391—411, have been received from Mr L. Isserlis : 

Page 396. Eqn (28) for = read “; Eqn (30) for 7yy= read ry,=; Eqn (35) for me 

Oz Oz 


a z 
oa 

read 4 
Oz 


Page 397. Eqn (87) for ry= read 1,=; Eqn (38) for cdz, read cqy2; Line following 
Eqn (45) for d=—Cyx, read d=—Cry; Eqn (46) for ys day read -ys'Gn2; Line following 
Eqn (46) for wf,? read ,,R, 

Page 402. Line 8 from top: after, 


c, \2 av 
melee =Isy {(a+24) (1- yn) a 


insert “provided the SDs of w for constant y are homoscedastic, or sufficiently so, for this 
to be an approximation.” 

Line 6 from bottom for y,p—736 read ys h— 306. 

Expression at foot of page, factor (7zyQx2—9.%y) of first term in large curled bracket 
should read (Tay Qa?y — Yay?) and factor (7x29x%—Ty2Fay) Of expression in numerator of last line 
of same bracket should read (12Qx2y —TyzJxy2)- 

Page 403. In denominator of second term of expression in first line for 1- yn? read 1—,n,?. 

Page 405. In denominator of left-hand side of Eqn (70) for g,22—7? read 9422-7? xy. 
me read 14 rey 


Page 406. Line below Eqn (71) and in Eqn (72) for - oe 
L474, QP vy 


Vole X. Parte“ | April, 1914 


BIOMETRIKA — 


A JOURNAL FOR THE STATISTICAL STUDY OF 
‘BIOLOGICAL PROBLEMS 


. FOUNDED BY 
W. F, R. WELDON, FRANCIS GALTON anp KARL PEARSON 


EDITED BY 
KARL PEARSON 


CAMBRIDGE UNIVERSITY PRESS 
C. F. CLAY, Manacer 
LONDON: FETTER LANE, B.C. : 
EDINBURGH: 100, PRINCES STREET 
also 
| H. K. LEWIS, 136, GoWER STREET, LONDON, W.C. 
WILLIAM WESLEY AND SON, 28, ESSEX STREET, LONDON, W.C. | 
CHICAGO: ‘UNIVERSITY OF CHICAGO PRESS 
BERLIN: A. ASHER AND CO. 
LEIPSIC: BROCKHAUS 
BOMBAY AND CALCUTTA! MACMILLAN AND CO/, LIMITED 
TORONTO: J. M. DENT’ AND SONS, LIMITED 
TOKYO: THE MARUZEN-KABUSHIKI-KAISHA 


Price Ten Shillings net. 


[Issued April 30, 1914] 


- Drapers’ ‘Cotapany. Research. Memoirs. % . 


Biometric Series, 


a ‘Mathematical Contributions ‘tothe 
_. Theory of Evolution.— XIII. On the Theory 


of Contingency and its Relation to Associa- |” 
tion and Normal Correlation. By Kart | 


Pearson, F.R.S. Issued. Price 4s. net. 


II. Mathematical Contributions to the 
' Theory of Evolution.— XIV. On the Theory 


of Skew Correlation and Non-linear Regres- 
sion. By Karn Pearson, F.R.S. Issued. 
. Price 5s. net. 

It. Mathematical Contributions to the 
matical Theory of Random Migration. By 
Kart Parson, F.R.S., with the assistance 
of JoHN BLAKEMAN, MSc. Issued. Price 
5s: net. 


Theory of Evolution.—XVI. 
Methods of ‘Measuring Correlation, By 
Kart Pearson, F.R.S. Jssued. Price 4s, net. 


V. Mathematical Contributions to the 


a oe of Evolution.—XVII. On Homo. 


Studies m ‘Néevonal De eornapn: ; 
I. On the Relation of Fertility in’ Man 


to Social Status, and on the changes in this 


Relation that have taken ~place in the last | 


50 years. By Davip Heron, M.A., D.Sc. 
Issued. Sold only with complete sets. 


II. A First Study of the Statistics of 
Pulmonary Tuberculosis (Inheritance). By | 


Kart Prarson, F.R.S.  Jssued.. 
III. A Second Study of the Statistics of 
: Pulmonary Tuberculosis. Marital Infec- 
tion. By Ernest G. Pops, revised by KARL 
Pearson, F.R.S. With an Appendix on 


-Assortative Mating by ErHe. M, ELDERTON. i 


Issued. 


V. On the Inheritance of the Diathesis 
; _-of Phthisis and Insanity. A Statistical | 


Study based upon the Family History of 


1,500 Criminals. By CHarLEs a 


MD, BSc. Issued. : 


Questions of He Day and of b, ine notes te net bench (vit ea 


‘I. The Influence of Parental Alcoholism 


on the Physique and Ability of the Off- | 


spring. A Reply to the Cambridge Econo- 
mists. By Karu Pearson, F.R.S. 
a Mental Defect, Mal-Nutrition, and 
the 'Teachei’s Appreciation of Intelligence. 
A Reply to Criticisms of the Memoir on 
‘The Influence of Defective Physique and 
* Unfayourable Home Environment on the 
Intelligence of School ean By Davin. 
Heron, D.Sc. 
III. An Attempt to sbaeebe some of the 
- Misstatements made by Sir Victor Hors- 
LEY, F.R.S., F.R.C.S., and Mary D. Stures, 
.M.D., in their Criticisms of the Galton. 
Laboratory Memoir: ‘A First Study of. 
the Influence of Parental Alcoholism,’ Y Be, 
By Karu Prarson, F.B.S. 


+. Theory of: Evolution—XV. On the Mathe- 


Mathematical Contributions to the 
On Further | 


VIL MendelismandtheProblemof Mental — 


‘ 


ty osis in nthe ‘Animal Kindo By dee 
ARREN, D.Sc., ALicE Lex, D.Sc., Eon 
- Lea-Smira, Manion RADFORD, and Kar 
_ Pearson, F.R.S, Shortly. awe 
VI. Albinism in Man. By Kari Pusat 
_ E, Nerrizsarp, and C.'H. Usuer. Text 
ran I, and Atlas, Part I. od ssued. Price © 
VII. Mathemntical Contributions tothe | 
Theory of Evolution XVIII. Ona Novel 
Method of Regarding the Association of — 
two. Variates classed solely in Alternative é 
Categories. By Karu PEARSON, F.RS. 
Issued. Price 4s. net. - 
VIIL. Albinism in Man. By Karn Pearson, __ 
; ’ E. Nerrursaip, and 0. H. Usamr. Text, gy sd Boney 
- Part II, and Atlas, Part IT. Issued. ‘Price 
303s. net. ‘ uz 
IX. Albinism in Man. ‘By Karu PEARSON, A st 
EK, Nerrimsarr, and C. H. Usoer. Text, — 


Part IV, and Part bg aoe ; Price. : 
21s. net. % " 


‘Price 3s. net each x excepted). 


VI. A Third ‘Study of the Statistics ° bot ih? 
_ Pulmonary Tuberculosis. The Mortality 
of the Tuberculous and Sanatorium Treat- 
ment. By W. P. ‘ELpsrron, ELA.’ ‘and 
AS Sus: Pree eT ‘Issued. gi 
Vil, be the AubeReaty of Natural Selection _ aae 
in Man. By E.C. Snow, D.Sc. Issued. Rsk! 
VIII. A “Fourth Study of the Statistics of eae 
Pulmonary Tuberculosis : the Mortality of 
the Tuberculous ; Sanatorium and Tuber- bg 
 culin Treatment. By W. PaLIn Ly gar Ceci ie 
F.LA., and. Sipney J. aes ALA, ate 
. Issued. | 
ix A Statistical. Study of Oral ‘Tem 
peratures in School Children with s on. 
reference to Parental, ‘Environmental 


‘MB. Juua Bett, MA, and. Kart 
PEARSON, F. BS: tones oe ‘ os 


ot 


IV. The Fight against Tuberculosisand _ a 
- the Death-rate bis i By Kart Ne 
- PEARSON, ' F. RS... 
V. Social Problems : Their Treatment: 
Past, Present and Future.” ele Kari igs 
__PEaRson, E.R.S.. pi os 
Vi Bugenics and Public Health. ere: 
to the York Congress of the Royal Sanitary 
Institute. By Kanu Pearson, F.R.S. — 


“Defect. I. A Criticism of Recent American ~ 
“Work, By Davin Heron, D.Sc. Issued — 
(Double. Number). Price Qs. NEL Hi 


vu ‘Mendelisinand the Problemof Mental 


Defect. II. The Continuity of Mental — 
Defect. a 


By Karu ae ERS and 
Gustav a Ts mae eae ys 


VoLuME X APRIL, 1914 No. 1 


BIOMETRIKA 


CONGENITAL ANOMALIES IN A NATIVE 
AFRICAN RACE . 


By HUGH STANNUS STANNUS, M.D. Lond, Medical Officer, Nyasaland. 


(1) I HAVE thought it would be of interest to put on record some observations 
made by myself in Nyasaland during the past seven years, on the subject which 
appears as the title of this paper. 


These observations relate to members of a native population of Bantu stock, 
belonging to several main tribes, namely, Mananja, Yao, Ngoni and Tumbuka, 
with a few references to the Nkonde in the north and the Nguru from the south- 
east. 


My interest in the subject was aroused by the frequency with which some 
abnormalities were seen and I think the facts I bring forward will go to shew that 
this unusual incidence is real and not only the result of the ease with which 
observations may be made among a partially clothed community. 


Statistics dealing with the subject, to be of value, must treat of large numbers, 
such have however only been possible in a few instances to be referred to later. 
I speak therefore largely from impressions in appraising the rarity or otherwise of 
any particular condition. It should be remembered in this direction that the 
cases now to be reported have been met with more or less casually, most of them 
while travelling on the path or in some village, few in the course of Native 
Hospital work and none in any Special Department. 

Classification is a matter of some difficulty for many reasons and as the number 
of anomalies to be described is not very large it is perhaps more convenient to 
consider the various conditions according to the anatomical part affected. 

One large section of congenital anomalies, Anomalies of Pigmentation, I have 
already dealt with (Biometrika, Vol. 1x. pp. 333—365), and they will not be 
touched on in the present paper. 


Biometrika x 1 


2 Congenital Anomalies in a Native African Race 


(2) Dealing with those deviations from the normal in’which there is a change 
of a more or less general nature, I refer firstly to Infantilism, at the same time 
recognising that such a condition may not constitute a truly congenital anomaly. 


To the class designated Idiopathetic Infantilism I should. relegate a woman 
aged 22 years seen in 1911 at Zomba who presented the figure and development 
of a girl of 13. There was no breast development, no pubic or axillary hair and 
the rounded contours of the body and limbs usually associated with this age in a 
woman were wanting; menstruation had not commenced. In other respects she 
appeared normal and her mental development was but little if at all below the 
average. 


(3) In W. Nyasa I encountered a very excellent example of the Ateliotic Dwarf, 
a perfect “little man,’ a man in miniature 1:25 metres in height. Another case 
which I think must be considered as one of simple dwarfism is here reproduced :— 
Samuti, aged 35, a Yao, 1:42 metres high. He is shewn together with a man of 
1:85 metres. Samuti shews no other abnormality (Plate IJ, (1)). 


No case of Cretinism or Myxoedematous Dwarfism has been seen. I may here 
mention that Cachetic Infantilism is well seen in some cases of spinal caries 
among Natives just as among HKuropeans. 


A paper on “Congenital Humeral Micromely” in the Nouvelle Iconographie 
de la Salpétriere, T. xxiv. pp. 463—471, Paris 1911, by Dr S. A. Kinnier Wilson 
and myself, contains references to two cases of Achondroplasia in Nyasaland. 
Since then I have heard of two other cases and seen a fifth :—Etimu, male, aged 
25 years, a Yao, son of Masinjiri of Ndindi’s near Chipoli, Dedza District. The 
subject stated that he had no children and that no member of the family was known 
to have been similarly affected. He is a perfect example of the condition as the 
photographs will attest, and further remarks are unnecessary (Plate I, (3) and (4)). 


The following measurements were made and tracings of his hands are here 


depicted (Fig. 1): 


(1) Head: maximum length. : : : ; .  ,20:1>em: 
(2) - breadth . : : : : 2 elas 
(3) circumference . 5 : : F : . 600 
(4) Nose: length, base to root . : ; : 3°6 
(5) breadth, across nostrils. : : : : 45 
(6) Face: bizygomatic breadth . : : Become 0) 
(7) length, nasion to chin 4 3 : 5 ae lala 
(8) F. to commissure of lips : ; 6'7 
(9) Standing height . : ‘ ; LS 2 
(10) Span of arms. ; : : . diss 


(11) Arm: acromion to external condyle of humerus. Bi a4) 


H. S. Srannus 3 
(12) Forearm: external humeral condyle to tip of ulnar 
tubercle. : : ; : : 3 : Se aliecm: 
(13) Forearm to tip of middle finger . ‘ : : oe Oe 
(14) Leg: top of iliac crest to head of fibula. ; 5 SS 
(15) 5 - - to external malleolus — . eS 
(16) . . z to sole of foot i : ~ ol 
(17) Trunk: upper border of sternum to umbilicus . . 3&4 
(18) ‘ symphysis pubis 45 
BY: 
Left. Fig. 1. Etimu. Right. 


(4) No case of actual Gigantism has been seen. Tallness or shortness often 
runs in families. The tallest man I have ever seen measured 1°92 metres. He 
was the father of an albinotic child and had internal strabismus but no signs of 
acromegaly (see Plate I, (2)). Another man who I have not seen but who was 
measured by Dr Davey at Kota Kota was 2:0 metres in height. No case of 
Acromegaly has been seen by myself. 


(5) The following case in the want of development of the lower jaw and 
zygomatic arches might be considered as the converse to acromegaly (Fig. 2). 
From the sketch the subject will at once be recognised as a type of Congenital 
Idiot, the above-mentioned features and ill-formed pinnae together with the rather 
bird-like appearance being characteristic. 
1—2 


4 Congenital Anomalies in a Native African Race 


Jaidi, male, aged 20 years, a Yao of Chumbosa, Bursali, is the second child of 
a family of three, the elder brother being dead and the younger sister normal. 
No family history was elicited. 


Fig. 2. Jaidi, 


The growth of the face is defective as before noted, the zygomatic arches are 
so little developed that there are practically no cheeks. The descending rami of 
the jaws converge very considerably so that the floor of the mouth is very narrow 
and the horizontal rami are so short that the symphysis is situated mid-way be- 
tween the lower lip and the neck as they lie on one horizontal plane. The palate 
is high and narrow. 


The following measurements were made: 


Maximum occipito frontal . : : ~ 19:tsem: 
e bi-parietal . : : ; : . 138 
Bizygomatic at junction of zygoma with temporal . > 123 
Nose: length. : : ; : : S47 
breadth . ; : : ; : ; : : 318 
Face: nasion to commissure of lips . : oleae 
» 5» symphysis of chin . : | lisa 


Right external strabismus is present and vision defective. 

Though mentally an imbecile with an impaired speech he is an excellent field 
labourer. He states that no woman would marry him but that he has had sexual 
intercourse and that he is capable of the act. 


H. S. Srannus 5 


A few other cases of Congenital Idiocy have been seen and include an example 
of Spastic Diplegia, a Mongol Idiot aged 4 years in W. Nyasa district and two 
microcephalic idiots met with in adjacent villages in Chikala district, in neither of 
which were factors of etiological interest elicited. 


(a) Aged 22, male, looked like a boy of 12 in physical development, the head 
was very stnall but no measurements were made; the palpebral fissures were 
markedly slanting downwards and inwards and an internal strabismus was present ; 
the ears and palate were normal; the hands large and like those of a man. 


(b) A male infant aged one year with so marked a degree of microcephaly as 
to approach in type anencephaly, the resemblance being the more marked as the 
protuberant eyes and lips were like those characteristically found in anencephalic 


monsters (Plate IT, (7)). 


(6) The following case is given at length (Plate II, (5) and (6)). 


Masimosya, aged 19 years (1911), a Yao of Chipi’s village Zomba, exhibits a 
marked want of development of sexual organs (male) associated with large breasts. 
The general form of the body is that of a woman; the attitude, voice, laugh, 
facial aspect and expression resemble those of a woman rather than of a man. 
The teeth are good, the body and limbs well developed and there is a fair deposit 
of subcutaneous fat. The breasts (see photo) are remarkable, being large, with 
large well-formed nipples and well-marked areolae, dark in colour. They have 
started to beome pendulous and resemble exactly those of a nulliparous woman 
of the same age. The abdomen is well formed and round the umbilicus there 
is a deposit of fat such as is commonly seen in women; the pelvis appears large. 
There is some hair in the axillae but none on the face or body. The pubes is 
rather prominent resembling the female mons veneris and there is some develop- 
ment of hair upon it. The penis is very small, only two inches in length and of 
infantile type, the glans is covered by a prepuce and there is no deformity. The 
scrotum is very small indeed and only contains one testicle, the left, which can be 
felt as a small body about the size of a bean, three-eighths of an inch long. The 
right testicle is not apparently present in the scrotum or inguinal canal. The 
scrotum shews no tendency to be divided nor is there anything in the arrangement 
of the skin to suggest labia. No rectal examination was made. . 


The subject is insane. He is fairly tractable and good-natured. He has 
delusions and hallucinations, it is reported, with various phases of the moon, when he 
is said to travel 15 miles to bathe in a certain stream, etc. He has tried to burn 
down some houses. I could get very little of his history. The mother and father 
are said to have been normal; the only other child, a girl, was insane and died in 
the Central Asylum. The subject once cohabited with a woman who was to have been 
his wife, but she ran away the next day and I was unable to find out from him if 
he had any sexual desire. Such is a case which would have been called one of 
Partial Hermaphroditism but in the absence of further data I shall not discuss it. 


6 Congenital Anomalies in a Native African Race 


(7) Obesity. No-cases of general obesity outside normal limits with possibly 
a congenital origin have been seen. Steatopygy does not occur. 

(8) Symmetrical Lipomatosis is conveniently considered here though perhaps 
not strictly within the subject. Three old women have been seen all presenting 
the same abnormal feature, namely, the presence of symmetrical lipomata in both 
axillae, each about the size of a small orange. In a fourth case the affection was 
one-sided, the subject giving a history of the gradual descent of the tumour from 
the upper aspect of the shoulder into the arm-pit. 

That these tumours were lipomata I can only support by clinical examination, 
they certainly were not of the nature of the pads seen in myxoedema and no 
signs of that disease were present. There is the possibility that they were acces- 
sory breasts but they did not present the characters found in undoubted cases of 
this condition. These tumours may have a similar pathogeny to the masses seen 
on either side of the back of the neck of men and specially described by 
Sir Jonathan Hutchinson; on account of their possible paleogenetic significance 
I have included notes on these cases here. 


(9) Lymphatism, Post-mortem examination on a boy 10 years of age who died 
after receiving a blow on the head revealed a thymus gland of considerable bulk, 
4 inches long. The blow had not severed the soft tissues over the skull and in 
the absence of any other evidence of injury or disease one might suspect the case 
to be one of lymphatism, an inherent disorder which had predisposed to death. 
In a second case, that of a woman aged 40 years who died after moderately severe 
burns, a body 44 inches long of yellow colour and firm consistency was found lying 
on the anterior surface of the heart, the apex of this body being at a level with 
the 2nd costal cartilage. 

(10) Coming now to Malformations, there is a well-defined deformation of the 
skull of which I have seen several examples, the main points of which are well 
shewn in the photographs. The extreme height of the cranium and marked 
dolicocephaly without bossing of the forehead, while the sides of the vault of 
the skull are flattened, are characteristic. The photographs depict a boy aged 7, 
son of Matikwiri, headman of Mlanje, whose two younger sisters are said to 
resemble him exactly in the deformity present (Plate IV, (12) and (13)). 

The second case is a boy aged 15 years, the head measured 21°5 cm. long and 
12°5 cm. broad (Plate III, (9)—(11)). 

(11) Congenital Ptosis is not uncommon and is associated with the typical 
expression due to this disability. A slight degree of Epicanthus may be fairly 
often observed; more marked, it is sometimes seen associated with obliquity 
of the palpebral fissures giving a regular mongolian character to the face 
(Fig. 3). 

Buphthalmos has been seen on two occasions in young adults with a history of 
its congenital nature but nothing else of note; tension normal and vision appa- 
rently good. 


H. S. Srannus T 


Microphthalmos was once seen associated with coloboma of the iris and choroid 
(see below). 


Coloboma. This defect was met with in two brothers aged about 18 and 17 
years, but neither parent nor, as far as 1 could ascertain, any other member of the 
family was similarly affected. Bwanali the elder presented a coloboma of the iris 
and choroid of the left eye; there was also a small opacity on the posterior surface 
of the lens, which however could not be traced more deeply but which suggested a 
remnant of an “arteria centralis.” . 


2 CLs 


Fig. 3. Epicanthus. 


The right cornea shewed some superficial opacities, the iris appeared normal, 
but examination of the fundus revealed a large white triangular area with the 
apex near the disc with here and there small masses of pigment. The middle 
portion of the white area was on a much deeper plane than the rest of the fundus, 
forming a posterior staphyloma, the whole composing a kind of posterior coloboma 


(Fig. 4). 


Right eye. Left eye. 
Fig. 4. Bwanali Coloboma. 


This boy also had an accessory nipple. 
The younger brother Pete presented on the right side a microphthalmic eye 
with coloboma of iris and choroid resembling the condition in his brother, with an 


8 Congenital Anomalies in a Native African Race 
opaque spot on the posterior surface of the lens. The eye is convergent and 
vision poor; he counts fingers at one yard. The left eye is normal. 


Dermoid Cysts of the Face have been seen in the situations shewn in the sketch 
(Fig. 5). One of these was excised and found to contain the usual pultaceous 


Fig. 5. Dermoid cysts of face. 


mass mixed with hairs. These hairs examined microscopically were found to be 
spindle shaped, tapering at each end, brown diffuse and granular pigment was 
present in them. 

A relic of the cleft between the median and upper external processes of the 
foetal face was on one occasion seen as a small pit at the lower extremity of and 
just external to an epicanthal fold. 


(12) Congenital Naevus. Only two cases of naevus have been seen. One a 
woman presented a small naevus just to the left of the middle line on the forehead 
at the margin of the hairy scalp, 1 cm. in diameter. The second was a man with a 
similar growth 1 cm. in diameter on the lower lip just to the right of the middle 
line (Ching’waya of Zomba). 


(13) Har. The general conformation of the ear varies a good deal; some of the 
types are shewn in the sketches (Fig. 6) but all these must be considered as coming 


J III 


Fig. 6. 
within the limits of normal variation. In one case a kind of Accessory Lobule was 
noted ; the subject was an albino. A number of persons with Accessory Auricles 


H. S. Srannus 9 


have been seen. These consist of little subcutaneous nodules of cartilage forming 
tubercles one to four in number situated just in front of the tragus, the affection 
being usually bilateral. 

An abnormality seen affecting a woman in N. Nyasa consisted in the direct 
prolongation of the skin from the side of the head on to the outer surface of the 
pinna so that the upper margin of the ear was hidden, though easily felt beneath 
the skin. 


Helical fistula. Under this name have been described the remains of the first 
branchial cleft found as little pits on the helix. The condition is certainly rare 
in England and persons exhibiting the anomaly are sometimes shewn as interesting 
cases at medical societies. That heredity plays a part in its incidence is well 
known as illustrated by a case shewn by Dr Prichard at the Royal Society of 
Medicine, an infant with symmetrical helical fistulae, whose mother, four siblings, 
maternal grandmother and two great-aunts all exhibited the same defect. 
Having noted this same anomaly in quite a number of natives I became interested 
to ascertain the actual incidence. The statistics given below embody the results 
of my observations covering nearly 6500 individuals of all tribes. The popula- 
tions of whole villages were taken so that consecutive unselected persons were dealt 
with, 


Tribe core Right Left pou 

. { Males 416 7 6 3 

No gis { Females | 612 13 Soni bas 
eS Males ... | 100 | — 4 1 
Females... 136 — 3 — 

Wionsa J Males... 1941 | 34 22 12 
6” | Females... 2576 | 69 53 23 
Wankonde* ... as 455 | 4 8 5 
Awemba a ace 48 1 — 1 
Anyanja ee oa 65! 1 1 -— 
Ahenga sb aah | 142 3 5 — 
Totals... .. | 6491 132 110 50 


Thus among 6491 individuals of all ages and both sexes a total of 292 were 
found to have helical fistula (4°5 °/,). It was more commonly unilateral, affecting 
the right side a little more often than the left, giving percentages of 2°08 and 1°69 
respectively and for bilateral cases 0°77°/,. Taking each sex we see that the 
proportions between the three numbers are almost the same. 


2457 males 3 41 By) 16 
3324 females... 82 64 28 


* These figures were kindly supplied by Dr Davey. 
Biometrika x 


10 Congenital Anomalies in a Native African Race 


The actual incidence in the two sexes is however greater among females than 
males in the proportion of 5:2°/, to 3°6°/,. An abnormality occurring so fre- 
quently as 45 per mille might almost be considered to be a variation within the 
limits of the normal. The fact remains, however, that it is the persistence of 
a foetal character and abnormal, if the whole of mankind be taken into con- 
sideration. 

Dealing more in detail with this defect, there is some variation in the exact 
site of the fistula; the sketches (Fig. 7) serve to illustrate the extremes of posi- 
tion in three directions. 


JIIVG 


Fig. 7. 

Three cases presented two pits on the same side, one each in positions A 
and B. In these three cases the affection was bilateral and symmetrical. The 
common position at which the pit is found is in D. In another case not included 
in the series a pit was observed resembling those above mentioned but situated at 
the junction of the tragus and lobule as in £. 

These helical fistulae, which I have described as little pits, consist of a small 
opening on the skin 1 or 2 mm. in diameter leading into a blind sac 1 or 2 mm. 
deep; often this sac opens out into a little ampulla which can be seen and felt 
under the skin. The ampulla and canal are generally filled with a little plug of 
sebaceous matter. 

In three cases the skin in this situation looked like scar tissue and presented 
a honey-combed appearance, there being several openings into the ampulla giving 
the impression that an abscess had formed at some past date in the ampulla with 
consequent loss of tissue. 

The fistula is so common and so unremarkable that most tribes have no name 
for it and one cannot elicit long pedigrees to shew its incidence in families. Cases 
of heredity were common enough but the type was not necessarily the same in 
members of the same family; thus a mother with Left Fistula had a child with 
Right and Left, or again, three brothers were seen two with the Left side affected, 
the third with Right Fistula. 

No malformations in connection with other branchial clefts have been seen, 


H. S. Srannus 11 


(14) Lips, Mouth and Palate. Most natives shew a well-marked tubercle in 
the median line on the “red” margin of the upper lip; in a few however this is 
replaced by a distinct groove which involves the red margin of the lip or only the 
subjacent fold of mucous membrane (see sketch Fig. 8 and photo, Plate V, (16)). 


Jae ‘ = 


Fig. 8. 


These cases resemble one of a Hindu (recorded in the Lancet, Oct. 2, 1909, by 
Thurston), who besides having the median hare-lip was the subject of poly- 
dactylism. In one of my cases there was a considerable gap between the upper 
central incisors but no further abnormalities were present. 


In a single case notching of the upper lip was found to the left of the middle 
line with a mark running up to the nostril which looked like a scar. There was 
no question of any operation having been performed, though the condition 
resembled exactly an artificial repair of a lateral hare-lip (Fig. 9). A similar 


Bigs 9: 


case has been shewn at the W. Lond. Med. Chir. Society in which there was, 
besides, a deformity of the nose and a family history of hare-lip. I have only seen 
one case of ordinary Hare-Lip, a Blantyre boy aged 10 years (1909), the affection 
being left-sided and unassociated with any cleft of the palate (Plate V, (17)). 
Among 30,000 natives examined in the northern districts of this country no 
case was seen. 


No case of typical Cleft Palate has come to my notice; on the other hand I 
have seen three cases which owing to their non-association with defects in the 
upper lip are of great interest. All three cases, one a boy aged 10 years (1906), 
the other two adult males, presented complete Absence of the Premasilla and 
attached teeth. In the boy there was also a Median Perforation in the hard 
Palate. Congenital perforations of the palate apart from clefts are apparently 
rare in Europe. Dundas Grant (Roy. Soc. Med. April 1910) has recorded the case 
of a girl aged 16 years with a perforation above and to the right of the base of 


2—2 


12 Congenital Anomalies in a Native African Race 


the uvula with no history of trauma or syphilis. Prof. Karl Pearson has drawn 
my attention to a skull which was brought by Du Chaillu from Fernand Vas in 


the Congo (see Biometrika, Vol. vit, Plate XXVI); this shews congenital . 


absence of the premaxilla, but the two maxillae have not approximated in the 
mid-line in front as in my own cases, and we do not know the condition of the 
soft parts, but it is interesting to see this anomaly from another part of Africa. 


(15) Teeth. Native children are said to be born sometimes with teeth; it is 
possible that this is not very rare as there is a common superstition regarding 
them. I have seen one case with this history, to be mentioned later, as having 
deformities of the lower extremities. A gap of as much as 4 of an inch between 
the lower central incisors has been noticed a number of times, the other teeth 
all being regular and touching one another. A similar condition may be seen also 
affecting the upper pair of incisors, one that I am not conversant with among 
Europeans. Among 1500 natives examined for statistical purposes in regard to 
caries the following numerical abnormalities were noted: 


(a) Complete reduplication of the set of teeth in an adult, the second set 
lying on the palatal side of what appeared to be the normal set. I have every 
reason to believe that this was a case of true reduplication, that is to say, the 
result of growth from doubled enamel organs and not of retention of the deciduous 
teeth. 


(b) Reduplication of upper incisors. 
(c) Reduplication of right lower bicuspid. 


(d) Reduplication of both bicuspids in the lower jaw on each side and in the 
upper jaw on the right side in a woman aged 24 years. 


A single case of a Bifid Eatrenuty to the Tongue was seen in an albino 
child. 


(16) Polymazia and Polythelia. 14 cases of these anomalies have been met 
with casually, so that I imagine this anomaly by excess is comparatively not 
uncommon. Short notes of these cases are given below for purposes of com- 
parison : 

(a) Male adult, accessory nipple springing from the skin at the right sternal 
edge opposite the 38rd intercostal space, it was large and well formed like a 
woman’s but there was nothing resembling an accessory mamma beneath it. 


(b) Male aged 45. Insane and suffering from spinal caries. There was a 
rudimentary accessory nipple in Scarpa’s triangle on the right side 14” below 
Poupart’s ligament. 


(c) Adult female, an accessory nipple on the right breast, small but well 
formed and lying above the one proper to the breast; both are patent and milk 
can be drawn through both. 


H. S. Srannus 13 


(d) Adult male, the accessory nipple is situated in a line with the left nipple 
below it and half-way between it and the costal margin. 


(e) and (f) Two women each had two nipples to the right breast. 


(g) A young woman was found to have two nipples on the left breast 
(Plate V, (18)). 


(h) Male with congenital coloboma iridis mentioned above has an accessory 
nipple just above and to the inner side of the right nipple. 


(«) Young male adult has just at the outer edge of the areola of the left 
breast a very small accessory nipple, and beyond this and above it over the third 
intercostal space another flat nipple with areola and hairs. 


(j) Male, presents a rudimentary nipple in the left groin just below the 
middle of Poupart’s ligament. 


(k) Young adult male shews a small accessory nipple just below and internal 
to the right nipple; his brother, father, and grandfather are all possessed of the 
same identical anomaly. The subject has no children, no nephews or nieces. 


(1) Female in hospital with syphilis has a small accessory nipple springing 
from the skin of the chest wall just internal to the point of the left pendant 
breast. 


(m) A woman with well-formed accessory breast in the right axilla. It is 
breast-shaped and pendant though there is no nipple. The woman volunteered 
the fact that it was a breast and said it swelled with pregnancy. The right breast 
was twice as big as the left. 


(x) An old woman with symmetrical masses in each axilla resembling rather 
the symmetrical lipomata mentioned elsewhere: see p. 6. She states that they 
appeared at puberty and thinks them to be breasts but denies that they enlarged 
with pregnancy. 

In the Japanese this condition has been shewn to be not unrare, and among 
them tuberculosis has been found to be more frequent than among the normal 
population. I can only support the idea with one case (No. b). 


(17) Meningocoele and Spina Bifida. No typical case has been noted. A man 
was seen with a little dipple of the skin over the lower part of the sacrum in the 
median line having a little fold of skin on either side forming two small vertical 
lips. 


(18) Penis, Testicle; Hernia. 
Epispadias, hypospadias and extroversion of the bladder have never been seen. 


T have seen a boy aged 18 years with a short penis enclosed in a fold of skin 
from the upper surface of the scrotum (Fig. 10).. The boy had other deformities 
which are described later. When examining a number of recruits I was surprised 
to find in a large proportion the right testicle hanging lower than the left, the 


14 Congenital Anomalies in a Native African Race 


reverse of what is known to occur among Europeans. On examination of 400 
consecutive men, adults, between the ages of 30 and 40 years, I found in 166 or 
41°5°/, the right testicle lower than the left. In the remainder or 58°5°/, the 
right testicle was on a level with the left, or rather higher in the scrotum. I also 
got the impression that, associated with right lower testicles, the testicles and 
penis were large. In another series of 280 men, the left was lower than or on the 


Fig. 10. Boy aged 18. 
same level as the right in 185; the right lower in 88. There were two cases of 


left cryptorchidism, one of right cryptorchidism; one each left and mght hydro- 
coeles and two right inguinal bubonocoeles. ites 


I have come across a number of cases of undescended testis among other 
natives, in some associated with a swelling in the inguinal canal, in others there 
was complete cryptorchidism. Inguinal hernia is not infrequent in adult males 
but I can give no figures relating to a large number of persons. In a single man 
it was associated with umbilical hernia. I have never seen a femoral hernia. 
Umbilical hernia is common enough especially in children. The following figures 
though small in number give some idea of the frequent incidence of the condition. 
They refer to all the children in a single village and may therefore be said to be 
unselected in any way. 


Age = P oe aE 

O— 1 year 18 13 6 

1— 2 years 44 27 12 
3—10_,, 102 44 10 

Totals 164 84 28 | =276, 

giving roughly COM. 0k Gis 

_—_—Yr— 
40 °/, 


That the hernia diminishes in size even to disappearance after childhood, as 
indicated by these figures, is certainly true as the same incidence is undoubtedly 
not found among adults. The protrusion is sometimes very marked and takes the 


————— eee ar wv ~ 


a 


H. S. Stannus 15 


shape of the finger of a glove, some several inches long and curving downwards 
(Plate IV, (14)). Writing recently E. M. Corner in doubting the commonness of 
congenital sacs in hernia in general, as insisted on by some writers, has shewn in a 
series extending to between two and three thousand observations that herniae in 
children are often multiple and associated particularly with a ventral hernia, a 
diastema, which though very rare at birth 1s common in young children and of the 
nature of a true hernia. He believes that this ventral protrusion, which is certainly 
not congenital, is caused by increased abdominal pressure due to gaseous distension 
of the bowels the result of fermentative processes, and that other herniae are due 
to the same cause. Among native children abdominal distension is almost the 
rule, “ pot-bellied” is an expression always used in speaking of them. This dis- 
ténsion is due largely, I believe, to fermentative processes, and also a second factor, 
absent in European children, namely enlarged spleen. Of 50 children under the 
age of 5 years taken from among those with umbilical hernia, 43 or 86 °/, were 
found to have the ventral protrusion as described by Corner. 18 of these had 
enlarged spleens and 20 shewed a considerable abdominal distension. In none 
was any other hernia found. In these cases we see par emcellence the effect of 
intra-abdominal pressure, in producing first ventral hernia and ‘secondly umbilical 
hernia. The weakness of the umbilical scar is due, I have little doubt, to the 
method of treating the cord at birth. The custom prevailing among many is to 
bind the whole cord and placenta on to the child’s abdomen till it separates ; 
with others the greater part of the cord is so treated after severing the placenta ; 
in any case there must be considerable tension, I think, at the umbilicus and 
sepsis is more likely to occur. Cursham Corner has said that the size of the 
bulging is proportional to the length of cord left proximal to the ligature, and 
the same principle adapted to natives who use no ligature may be true, and 
thus account for the very “long” umbilical hernias. 


I am therefore inclined to agree with Corner that the umbilical and the ventral 
herniae of children are due largely to intra-abdominal pressure, but though my 
numbers are small, the absence of any other hernia among my cases must be 
taken to mean that for their production there is another factor to be taken into 
account, and that is, I believe, in Corner’s cases some congenital structural 
anomaly, namely a congenital sac, and, conversely, I think congenital sacs are 
uncommon among natives of this country. 


(19) Malformations of the Extremities. Various forms of Congenital Talipes 
are met with which call for no special comment. 


A peculiar condition characterised by symmetrical shortening of the humeri 
has been observed and forms the subject of a paper by Dr 8. A. Kinnier Wilson 
and myself referred to above; certain deformities of the hands and feet are also 
therein dealt with. 


Since this paper was written I have seen three other cases of Congenital 
Humeral Micromely, one of which I mention here as there is a family history of 


16 Congenital Anomalies in a Native African Race 


the defect, a point of some interest and one which I had not elicited in previous 
cases. 


Gobedi, male, aged 22 years, a Yao employed as a machila carrier in Zomba, 
exhibits the deformity in typical form well represented in the photograph 
(Plate V, (19)). 


The head of each humerus appears to be poorly developed and though move- 
ment at the shoulder joint is free, a certain amount of fine crepitus is elicited, 
such as was found in several of the other cases. 


The point of interest however is the fact that the maternal aunt is stated to 
have had the same congenital anomaly. 


The subject has no brothers or sisters and his own two young children are 
stated to be normals, his mother and father and more remote relations are not 
known to be affected. 


Besides these the following cases deserve mention. 


A boy was seen, 18 years of age, with a peculiar deformation of the hands, 
stated to be congenital; the fingers and thumb shewed considerable thickening 
about the Ist interphalangeal joints with marked ulnar deflection; the bridge of 
the nose was depressed, the lips very thick, and epicanthus present. There was 
also the penile deformity above mentioned, except for which I should have 
doubted the statement in regard to the congenital nature of the hand deformity 
(Fig. 11). 


Fig. 11, Boy aged 18, 


H. S. StTannus 17 


I saw at Bandawe a female infant aged 14 years presenting multiple defor- 
mities. The astragalus of the left foot was apparently implanted in a cup- 
shaped depression on the lower end of a very much shortened thigh. The femur 
of this leg was short but around it there was an abnormal amount of muscle as 
if the usual amount of muscle for a normal had been cramped up into the 
shortened limb ; the foot could be freely moved by the child. The left foot had only 
a hallux and two toes with a partial cleft between the hallux and the adjacent 
toe, but I think four metatarsal bones. The right thigh was also somewhat 
shortened but the bones of the leg apparently both present, the knee-joint could 
not be distinctly made out and was flail. Right talipes equinovarus present, also 
right internal strabismus. No history of similar deformity in family, a brother a 
year older was born with two upper incisors. Father and mother normal. The 
father has two other wives with six and ten children respectively, all normal. 
Such gross congenital deformities are from time to time recorded in Europe, thus 
Lockart Mummery described a case of congenital absence of the femur in a male 
child, etc., in the Brit. Med. Jour. for November 5, 1910. 


In a male 35 years of age I found Congenital Absence of the Right Fibula, the 
tibia being bowed forward with 8 inches shortening of the limb, the foot on the 
same side had only three metatarsal bones and three digits including the hallux. 
A woman was seen with congenital shortening of one leg to the extent of four 
inches. 


A single case of unilateral Congenital Dislocation of the Hip has been met 
with. 


(20) Split Hand and Split Foot Deformities. The photograph, Plate IV, (15), 
serves to shew moderately well the deformities met with in a male child aged 
5 years (1905): in the absence of a skiagram it is impossible to go into the detail 
of the bony conditions present. There was no admitted history of similar or 
other deformity in the family. 


A second case, Ndala of Njalusi’s Mangoche, shewed a similar deformity of the 
left hand but in a less degree; he was otherwise normal and stated that no other 
members of his family were similarly affected (Plate VI, (20)). 


These cases are interesting to compare with those collected and classified by 
Lewis and Embleton in Biometrika, Vol. v1, 1908. 


(21) Shortening of the Fourth Metatarsal Bone. When first I entered the 
country my attention was attracted by a number of natives who presented a 
shortening of the fourth toe. 


Since then Captain Hughes has noticed the condition in Egypt. The descrip- 
tion he gives is as follows (Lancet, July 16, 1910):—“ The fourth toe is markedly 
retracted usually behind the level of the fifth toe. The phalanges are not appa- 
rently abnormally short, and the metatarsal bone can be felt unfractured but with 
the head very much farther back than usual. Commonly the digit is pushed 


Biometrika x 3 


18 Congenital Anomalies in a Native African Race 


upwards by the pressure inwards of the fifth toe. The condition is sometimes 
unilateral sometimes bilateral.” He adds that in one case the second metatarsal 
and, in another, the third metatarsal were also shortened. In a single case he 
saw a similar condition in the hand, shortening of the second and fifth meta- 
carpals. 

The above description corresponds exactly with the condition seen in this 
country. I have also seen other toes than the fourth affected, and I shew a photo- 


graph of a man’s feet with involvement of the metatarsal of the hallux; in- 


another case the fifth was affected; in another case, a woman, the common variety 
was associated with shortening of the third metatarsal of the left foot (see Plate VI, 
(22), (24) and Fig. 12). 


/ 


. / 
=. 


Fig. 12. Fig. 13. 


OM 2 


(22) Syndactyly of various degrees has been observed; sketches of two 
examples are given in Fig. 138. 

(23) Polydactyly is not at all uncommon. I have casually come across some 
dozen cases in five years. 


In the majority the supernumerary digit consists of a miniature phalanx 
attached to the skin of the hand or foot at the level of the head of the fifth meta- 
carpal or -tarsal bone. Such digits are often removed in childhood, leaving 
a small cartilaginous nodule at the seat of removal. Most commonly it is a 
symmetrical affection of both hands and feet; in other cases hands or feet alone 
(Plate VJ, (23)), or one extremity only, present the deformity. In some the 
accessory digit is well formed and an accessory metatarsal or metacarpal bone 
more or less complete is present. In one case it was the hallux which was 
reduplicated, the two digits being partially fused. In another, reported to me, 
the supernumerary digit in each hand was situated on the radial side of the 
first finger with probably an accessory metacarpal bone in connection with it. 
The feet bad extra digits beyond the fifth toes. 

(24) The following case is of some interest: 

Chibisa, male, an Angoni of Kawenga’s, aged 30 years. The deformities in- 
volve all segments of the right upper and lower limbs and to a minor extent the 


OO 


H. S. Srannus 19 


left limbs (Plate VI, (21) and Figs. 15—18). On the right side there is shortening 
of the humerus and forearm (10 cm. difference between the two sides), but 
elongation of all the segments of the middle finger and its metacarpal bone; the 
middle finger itself measures 10} cm. The metacarpal bones and phalanges of 
the other fingers are, I think, absolutely a little shortened. The left arm and 
hand are normal, except that this hand as also the right hand shew a little 
nodule at the base of the little finger where a supernumerary digit was removed, 


® 
Fig. 15. Fig. 16. 
O 
Qinee 
E™ 
Fig, 17. Fig. 18. 


The right foot presents a similar condition to the right hand, elongation of 
phalanges and metatarsal affecting the second toe, the toe itself being 7 em. 
long. The tibia is somewhat bowed outwards. The left foot presents shortening 
of the metatarsal bone of the hallux. . 

3—2 


20 Congenital Anomalies in a Native African Race 
The photograph and sketches illustrate some of these points. Other measure- 
ments were as follows: 
Height 166°5 cm.; span of arms 164°5 cm.; 
Maximum fronto-occipital 18°0; maximum biparietal 13'8; - 


Nose length and width 4:4. 


(25) Congenital Anomalies of the Kidney. Post-mortem examination on a 
native prisoner who died of pellagra revealed the presence of a double kidney 
on the left side and none on the right. 


From the sketches (Fig. 19) it will be seen that the upper part was the one 
proper to the side while the lower half was the abnormal portion. 


L.Suparenal-|-- 


Fig. 19. Churinigu. Kidney of Left side double. 


The two parts were really very distinct, partly separated by a groove and 
cleft. 

The lower viscus had been felt during life as a tumour in the abdomen of 
unknown nature as it lay along the left side of the vertebral column. The kidney 
was unfortunately removed before dissection of vessels, etc. was made, but the 
sketch shews the arrangement of these at the hilum of the kidney. . 


The two ureters united below the lower pole of the double organ, the distal 
ureter being nearly twice the normal size. 


Se a Se ee 


HLS. Sraynvus 21 


The bladder was normal; there was no right ureter. The suprarenal body 
of the right side was in its normal position and appeared normal. No other 
abnormalities were remarked. 


(26) Some suggestive observations have been made by Dr Ewald Stier, 
published in the Deutsche Zeitschrift fiir Nervenheilkunde (Band xiv, Heft 1-2, 
S. 21), from which the generalisation is made that in all anomalies of overgrowth the 
right side of the body is much more frequently involved than the left, whereas in 
anomalies of undergrowth the left is more commonly the site of the condition 
than the right, this distribution being the result of a preponderance of persons 
with a leading or superior left cerebral hemisphere, as with left-handed persons 
the converse was found to be true. In other words, the plus anomalies occur on 
the right side in right-handed people and the minus anomalies on the left side, 
the left hemisphere being the superior hemisphere, the converse being true. 


I have therefore tabulated my observations, and though small in number they 
tend to confirm the idea assuming that the African native is right-handed. This 
remains unproved and a less marked superiority of the left hemisphere may 
account for non-conformity of my few cases to Stier’s rule. 


| | 
Right | Bilateral) Left 
Plus Anomalies:  ° | 
Reduplication of teeth 2 3 0) 
Polymazia : : ae eel 1 0 
Polythelia ... soe She i0 | O 5 
| Polydactyly ... 1 1 2 
Minus Anomalies: 
Hare-lip ; | @ 2 
Cryptor chidism 2 0 2 
Absence of Fibula : 1 0 0 
~ and Tibia... 0 0) 1 
| Split "hand, foot 0 1 1 
| Shortened metacar pal, tarsal 0 1 3 
Syndactyly : 1 0 0 
Coloboma iris = 1 1 0 
Plus and Minus together : | 
Chibisa 1 (+) — | 1(-) 


In considering these cases it should be remembered that the majority of my 
observations have been made casually among natives met in the bush, in villages, 
etc., others in the course of routine work among troops, prisoners, etc., the few 
were the result of special investigation. 


(27) Concluding Remarks. The notes of cases which I have thus collected 
together form rather a medley of facts but I think certain deductions may be 
made from them. 


22 Congenital Anomalies in a Native African Race 


It would appear that 


(1) The slighter the anomaly the greater the frequency with which it may 
be observed. 


(2) The more marked degrees of deformity are only seen in children and 
those in places where European influence is felt. 


(3) Cases of heredity are only seen among the lesser anomalies. 


(4) The least obvious congenital anomaly is a helical fistula, and this is 
found in 46 °/, of the population and is frequently inherited. 


The difference in the observed incidence between the minor anomalies and 
those of more marked proportions may be real or only apparent. I think the 
latter supposition is true for reasons which can be deduced from the facts given 
above. 


It is the custom among all the tribes of this country to destroy all deformed 
children at birth. Any minimal deformity such as a helical fistula is of course 
unrecognised, an accessory nipple is probably hardly noticeable, accessory digits 
which can be removed by a nick with a knife are matters of no import, while a 
foot with six well-formed toes would hardly be considered worthy of note. These 
abnormalities are therefore comparatively common, but hare-lip, cleft-palate, 
deformities common enough in Europe, are among the rarest in this country; a 
child with a hare-lip would be seen to resemble a hare and would be immediately 
destroyed. Children with the greater deformities would certainly be destroyed. 
In recent years under European influence native customs fall into abeyance and so 
we see my single case of hare-lip in a boy aged 10 at Blantyre, a township of 25 
years standing, a child with gross deformities of the lower extremities born prac- 
tically on a mission station; or, to quote another example, an albino reported by 
myself was the fifth albino child born, the first four having been killed at birth 
by order of a chief, who in later years came under the influence of an up-country 
mission station, for which the living albino has to thank his survival. The gross 
abnormality of absence of premaxilla would pass unnoticed as the deformity is 
slight. History relates that in the case of the child with lobster claw deformity of 
hands and feet, it was only saved from a summary death by the efforts of the 
mother. 


I think with the evidence as it stands one may with fairness say that con- 
genital anomalies are common among the natives of this country. Secondly, I 
think one may also deduce from the facts stated that abnormalities of all kinds 
are at least not uncommon. In the few cases in which I have adduced statistics 
there can be no doubt, in other cases it is rather a matter of one’s impression. 


I have shewn that certain congenital anomalies among natives of Nyasaland 
are common and have attempted to argue that probably many of them are 
common. 


ae 


H. S. Srannus 23 


(28) Very few statistics are available for comparison, but I should hke to refer 
to some by writers in Egypt. Prof. Madden cites in a letter to the Lancet a case 
of cleft-palate which he operated on as the first in 11 years during surgical work 
at the Kasr-el-ainy Hospital, and assigns as the cause of the lack of such cases 
the “truly awful struggle for existence” which would eliminate infants so handi- 
capped. The Lancet remarked (Lancet, July 3, 1909), in an annotation upon this 
letter, that Prof. Elliot Smith considers it to be impossible to endeavour to explain 
this rarity of congenital defects in Egypt, unless the time-honoured scapegoat of 
our too modern civilisation be invoked to account for their frequency in other 
countries. Statistics of the Kasr-el-ainy Hospital compiled by Dr Day are quoted 
in 1907; among 2630 total surgical admissions the only congenital deformities 
were 5 hare-lips, 2 talipes, 2 imperforate anus, 1 extroversion of bladder; in 1908, 
2702 admissions, 3 hare-lips, 2 imperforate anus, 1 hypospadias, 1 undescended 
testicle, 1 meningo-encephalocoele. Capt. G. W. G. Hughes, R.A.M.C., in a paper 
to the Lancet, July 16, 1910, referring to this annotation, remarks “ Readers will 
be interested to hear that our too modern civilisation is innocent of this slur,” 
and goes on to shew that many congenital defects are by no means uncommon. 
Dealing with males between the ages of 14 and 21 years he gives the following 
figures : 

Hare-lip in 0°041 7%. 

Cleft-palate 0016. 

Polydactylism 0-058 '/ and 0:04 °%, in two series. 
Shortened metatarsal 0°37 7% and 0:23 % 

Other deformities of fingers and toes 0:22 7. 
Talipes 016 %. 

Among the thousands of ancient Egyptian bodies which Prof. G. Elliot Smith 
has unearthed and examined, a single case of cleft-palate was met with, a female 
of 20 years of age with a skull of negroid type, of between the 4th and 6th 
century B.c.; only one case of talipes (T. equinovarus) was recorded. 


It is obvious that in Egypt surgical treatment is not sought in cases of 
cleft-palate and rarely for other congenital defects but many of them are common 
enough. 


May the rarity of defects among the ancient peoples of Egypt be due to the 
same cause that acts in Nyasaland to-day? Were the children affected with 
deformities killed at birth and “thrown onto the dust-heap” where their remains 
were soon lost trace of? Of chief interest to me are the figures published by 
Captain Hughes. He shews that a shortening of the 4th metatarsal bone occurs 
in percentages rising to 0°37 of males examined. This defect is peculiarly common 
in this country. Again, polydactylism occurs in 0°05 °/, and other deformities of 
fingers and toes in 0:22 °/, of Egyptians, both deformities very frequently met 
with by myself in Nyasaland. 


24 Congenital Anomalies in a Native African Race 


He however does not mention polythelia and polymazia nor helical fistula. 
I should be very interested to learn if this last insignificant anomaly was looked 
for. There is no doubt that one of them, helical fistula, occurs with a frequency 
in Nyasaland which cannot be rivalled by any other among peoples of any race. 
I think one may also say with certainty that the incidence of others (shortened 
4th metatarsal and polydactylism) in this country is far in excess of that among 
Europeans, though probably much about the same as in Egypt. 


Upon what hypothesis can these facts be explained? Is there a single cause 
or are there many at work? These are questions which I shall not attempt to 
enter into, but by simply recording my observations I shall hope to stimulate 
others to do the same, for only by accumulating facts can it be hoped that such 
problems will ever be solved. 


Biometrika, Vol. X, Part | Plate | 


(1) (2) 


Samuti, an Ateliotic Dwarf. Subgiant, Height 1:92 metres, with Wife and 
Albinotic Child. 


Etimu, aged 25, an Achondroplasic Dwart. 


Biometrika, Vol. X, Part | Plate Il 


(5) (6) 


Masimosya, aged 19, Gynaecomastos, with other features which were formerly described as those of 
Partial Hermaphroditism. 


(8) 


Microcephalic Infant. [Case of Hydrocele testis included by an over- 
sight of Dr Stannus in the photographs, and 
engraved in consequence, Discovered too late 
to rearrange plates.] 


4 


Plate Ill 


Biometrika, Vol. X, Part | 


‘ATeydaooydvog SurAoys 


(01) 


‘supad GL pase ‘sog 


/ 


Biometrika, Vol. X, Part | Plate IV 


i 
5 


(13) 


Son of Matikwiri, aged 7, a case of Scaphocephaly. 


(14) (15) 


Cases of Umbilical Hernia, Split Hand and Foot in a child, aged 5, 


be 


Biometrika, Vol. X, Part | Plate V 


(16) (17) 


Case showing faint medium depression Blantyre boy, aged 10, with Hare-lip. 
of upper lip. 


(18) (19) 


Young woman with two nipples on left Gobedi, ayed 32, with congenital Humeral 
breast. Micromely. 


i 
. i ' i 
fi ‘ Sat hte . ‘ 
. i 2 : , 
. i 7 
- a is eo ' 
- a —s ~ % 7 ” 
i =, 4 i - 7 
1 3h whe = : - : ue 
- ~ 4 bo ge A? 
- f 4 
7 7 7 | . 
sd : i . 7 
, Fey , et 


Biometrika, Vol. X, Part | Plate VI 


(20) Ndala, Split Hand, left only. (21) Chibisa, aged 30, elongation of all segments of middle finger, 
; and its metacarpal bone, 


Zanes Ss. es EE 


(22) Shortening of the fourth metatarsal bone. 


(24) Shortening of the left great toe. 


C 7 P a 
7 . A - 5 
— 
‘ 
mie 22 in: Jase S >= + : - PK} : : 
¥ 5 e 7 
. ’ a i 
= i j 
ae : (, 
2 ae O24 \, = : 

e 2 
bd ry 


TABLES OF POISSON’S EXPONENTIAL 
BINOMIAL LIMIT. 


By Ee it SOPER, “McA: 


In his treatise, Recherches sur la Probabilité des SJugements, Paris, 1837, 
Poisson* shows that the series of frequencies 


(Dean am Oct n(n =i) pg + + ee (DEK ip + 
2 atin nih Lr 


given by the expanded terms of the binomial 
(p aL qs 
becomes in the limit, when gq is diminished, and n increased, indefinitely, but so 
that nq remains finite and equal to m, the exponential series 
S me m 
e™(l+_m+o——+...+—+4...]3 
2! r! 
and he points out that the terms of this series will give the proportional 
frequencies of the occurrences 
OF pie ee ney 
times, in any sample, of an event, every occurrence of which is equally likely in 
the sample and independent of the other occurrences, and which is of such 
frequency that m events occur in the sample on an average. 


The series is arrived at by “Studentt,” when considering the theoretical 
frequencies in sample drops of a liquid of minute corpuscles supposed distributed 
at random throughout the mass of the liquid. 


The event may also occur in time, each occurrence being supposed to take 
place with equal probability in any finite period taken as the sample, and to act 
independently of the occurrences of all the other events. A physical example, 
which appears by the closeness of the observed to the theoretical frequencies to 


* pp. 205 et seq. 
+ Biometrika, Vol. v. p. 351, ‘‘On the Error of counting with a Haemacytometer,” 


Biometrika x 4 


26 Poissows Eaponential Binomial Limit 


satisfy these conditions, is the number of a-particles discharged per {-minute or 
}-minute interval from a film of polonium *. 

In vital statistics the sample may be an individual or house or community and 
the event an accident or disease and so on. But it must be borne in mind that 
for such series as the above to be applicable the occurrence of one event in the 
sample must not preclude or influence in any way the occurrence of a second. 

The probability of « occurrences, m being the mean number, in a sample, is 

e-™ m*/x | 
and in the tables which follow this is evaluated for m= 0'1, 0°2... to 15:0 and for 


x =0,1, 2... up to such an integer as gives a figure in the sixth place of decimals, 
the number of places tabulated. 


The terms of the series were calculated, each by a fractional operation upon 
the preceding, beginning with the modal term and going both forward and back. 
Thus if m=7'6 the term e~*® x (7'6)'/7 ! was first calculated by tables of logarithms, 
and the succeeding terms were then obtained seriatim by the operations 

16 76 76 

meee ie eI) 
and the preceding ones by the operations 

ag eae etc 

06 5G" AEG ge 


done with a mechanical calculator, first a multiplication and then a division. 


tc., 


Seven places of decimals were thus calculated and the series is checked by the 
total, which differs from unity by the remainder (a figure in the eighth or later 
place of decimals in all the present cases) and the algebraical sum of the errors of 
seventh figure approximations. 


Poisson’s exponential series has been previously calculated to four places of 
decimals by L. von Bortkewitsch+ for values of m from 0:1 to 10:0. 


The present tables give the probability of each number of times of occurrence 
of the event. For the sums of these values, that is, the probability of occur- 
rence of the event, a given number of times or greater, or a given number of times 
or less, reference must be made to a second paper in this issue of Biometrikat, 
where such probabilities are calculated for integral values of m from 1 to 30. 


* See Rutherford and Geiger: ‘‘The Probability Variations in the Distribution of a-Particles,” 
Philosophical Magazine, Vol. xx. p. 700, 1910. See also EK. C. Snow, ‘‘ Note on the Probability Varia- 
tions, &c.,” Vol. xxi. p. 198, 1911, who finds the variance of experiment from theory to be such 
as would occur once in six experiments and once in three experiments respectively of the limited time 
taken, were theory exact. In a note to the first paper H. Bateman gives a proof of the exponential series 
of probabilities arrived at from considerations of this problem. 

+ Das Gesetz der kleinen Zahlen, 1898. A comparison of the table printed therein with the present 
table shows agreement except as to the fourth figure; the nearest fourth figure is not given, in rather 
many instances, in the tables of Bortkewitsch. 

+ Lucy Whitaker, B.Sc. ‘On the Poisson Law of Small Numbers,’ Vol. x. p. 37 et seq. 


H. E. Soper 


TABLE of e™m*/«!: General Term of Poisson's Exponential Expansion 
(“Law of Small Numbers”). 


m 
# | : | a ca : af 
0-1 o2. |, 03 04 | O08 0-6 0-7 0:8 09 | 1-0 
| | | 
0 | 904837 | ‘818731 °740818 | 670320 | °606531 | 548812 | -496585 | -449329 | 406570 | 367879 | 0 
1 | 090484 | "163746 °222245 | -268128  °303265 | -329287 | °347610 | -359463 | °365913  -367879| 1 
2 | 004524 | -016375 °033337 | 053626 | 075816 | -098786 | °121663 | 143785 | "164661 | -183940; 2 
8 | 000151 | -001092 003334 | -007150  °012636 | -019757 | -028388 | 038343 | 049398 -061313| 3 
4, | 000004 | -000055 = =*000250 | ‘000715 *O01580 | -002964 | 004968 | -007669 | °011115 | 015328 | 4 
i) — 000002 *000015 | 000057 | 000158 | -000356 | ‘000696 | -001227 | ‘002001 003066} 5 
6 — — ‘000001 | -000004 000013 | *000036 | *000081 | *000164 | 000800, -000511] 6 
ih — = ~ — | 000001 | 000003 | -000008 | 000019 | *000039 | 000073 | 7 
8 as = = | -- “000001 | *000002 | ‘000004 | “000009 | 8& 
9; ~ = - — _ — _ — — | 000001] 9 
| : - 
x Hes 1°2 1°3 Lp SN 1 1-6 HUSH 1°8 1:9 20 zy 
0 | 332871 | 301194 | -272532 | -246597 | -223130 | .201897 | 182684 | -165299 | °149569 | -135335| 0 
1 | 366158 | -361433 | -354291 | °345236 | °334695 | 3823034 310562 | -297538 | -284180 | -270671| 1 
2 | °201387 | -216860 | *230289 | *241665 | *251021 | .258428 | °263978 | -267784 | °269971 | *270671 | 2 
3 | 073842 | -086744 | 099792 | 112777 | *125510 | -137828 | 149587 | -160671 | 170982 | 180447 | 3 
4 | 020307 | -026023 | -032432 | 039472 | ‘047067 | 055131 | 063575 | 072302 | ‘081216 | 090224 | 4 
5 | 004467 | 006246 | -008432 | 011052 | °014120 | -017642 | ‘0216154 -026029 | 030862 | (036089 | 5 
3 | 000819 | °001249 | ‘001827 | 002579 | °003530 | -004705 | ‘006124 | -007809 | *009773 | ‘012030 | 6 
7 | 7000129 | :000214 | 000339 | °000516 | ‘000756 | ‘001075 | °001487 | -002008 | -002653 | 003437 | 7 
8 | 000018 | 000032 | -000055 | ‘000090 | ‘000142 | -000215 | 000316 | -000452 | -000630 000859 | 8 
9 | -000002 | :000004 | ‘000008 | :000014 | ‘000024 | -000038 | “000060 | -O00090 | -000133 | 000191 | 9 
10 = “000001 | ‘000001 | ‘000002 | ‘000004 | -O00006 | 000010 | -000016 | -000025 | -000038 | 10 
11 — — = — = ‘000001 | ‘000002 | 000003 | “000004 | 000007 | 11 
12 ae ee — — | — {| 000001 | 000001 | 12 
u 21 22 23 24 2:5 2°6 2:7 28 2:9 30 x 
| _ ie 
0 | °122456 | *110803 | *100259 | -090718 | :082085 | ‘074274 067206  -060810 | °055023 049787 | 0 
1 | :257159 | °243767 | 230595 | -217728 | -205212 | -193111 | 181455 | °170268 | 159567 | 149361] J | 
2 | 270016 | 268144 | -265185 | -261268 | 256516 | -251045 | -244964 | 238375 | 231373 224042 | 2 
3 | 189012 | °196639 | *203308 | 209014 | °213763 | °217572 | *220468 | -222484 | *223660 +224042| 3 
4 | 099231 | -108151 | *116902 | *125409 | -133602 | °141422 | 148816 | -155739 | 162154 -168031| 4 
5 | 041677 | -047587 | -053775 | 060196  -066801 | -073539 -080360 | 087214 | 094049 -100819| 4 
G | 014587 | 017448 | 020614 | ‘024078 | 027834 | 031867 | -036162 | -040700 | 045457 -050409 | G 
7 | 004376 | 005484 -006773 | -008255 | 009941 | 011836 | 013948  -016280  -018832 -021604| 7 
8 | 001149 | ‘001508  :001947 | 002477 | 003106 | 003847 | ‘004708  -005698 | ‘006827 -008102!) 8 
9 | 000268 | 000369  °000498 | ‘000660 | -000863 | ‘001111 | 001412 | -001773 | *002200 002701, 9 
10 | :000056 | 000081 | *G09114 | -000158 | :000216  -000289 | °000381 | -000496 | 000638  -000810 | 10 
11 | 000011 | 000016 | -000024 , -000035 | -000049  -000068 | -000094 | -000126 | ‘000168  -000221 | 11 
12 | :000002 | °000003 | -000005 | ‘000007 | 000010 | -000015 | ‘000021 | °000029 | ‘000041  -000055 | 12 
13 — 000001 | 000001 | ‘000001 | >000002 | ‘000003 | -O00004 | *000006 | 000009 = -000013 | 13 
14 — — | = | -- _- ‘QON0OT | *000001 | *OO00001 | 000002  -000003 | 14 
fon — —-| — | — — = = _ — 000001 | 15 
| 


28 Poisson's Exponential Binomial Limit 


TABLE—(continued). 


| 11 | *000287 | -000368 | *000467 | 000587 | -000730 | -00090) | °001102 | -001337 | -001610 | 001925 
12 | ‘000074 | 000098 | °000128 | 000166 | -000213 | -090270 | 000340 | -000423 | -000523  -000642 
3% | ‘000018 | -000024 | *000033 | -000043 , -000057 | -000075 | ‘000097 | 000124 | -000157 | -000197 
14 | -000004 | -000006 | “000008 | -000011 000014 | 000019 | -000026 | -000034 | “000044 | -000056 
| 15 | 000001 | *O00001 | “000002 | -000002 | -000093 | -000005 | 000006 | *CO0009 | -000011 | -000015 


MM 
v ie eer ae — as 
81 32 3-8 ow) ass 36 Pil 3:8 39 40 

0 | 045049 | -040762 | -036883 | -033373 | -030197 | 027324 | -024724 | 022371 | 020242 | -018316| 0 

1 | *139653 | -130439 | 121714 | -113469 | -105691 | -098365 | -091477 | -085009 | -078943 | -073263| 1 

2 | 216461 | -208702 | *200829 | -192898 | -184959 | -177058 | -169233 | “161517 | 153940 | -146525| 2 

3 | 223677 | -222616 | -220912 | -218617 | -215785 | -212469 | 208720 | -204588 | -200122 | -195367| 3 

4 | 173350 | -178093 | 182252 | -185825 | -188812 | -191222 | -193066 | -194359 | -195119 | -195367| 4 

5 | ‘107477 | -113979 | *120286 | -126361 | -132169 | -137680 | 142869 | -147713 | 152193 | 156293) 5 

6 | 055530 -060789 | 066158 | 071604 | -077098 | -082608 | -088102 | 093551 | -098925 | 104196 6 

7 | 024592 | -027789 | -031189 | -034779 | 038549 | -042484 | 046568 | -050785 | -055115 | -059540| 7 

8 | 009529 | 011116 | 012865 | -014781 | -016865 | -019118 | -021538 | -024123 | -026869 029770! 8 

9 | 003282 | -003952 | °004717 | -005584 | -006559 | -007647 | -008854 | 010185 | -011643 | -013231| 9 

10 | -001018 | -001265 | ‘001557 | -001899 | -002296 | -002753 | 003276 | 003870 | 004541 | 005292 | 10 
1 
2Q 


me 


Soe 
s 


5 
16¢)| = : — 000001 -000001 | -000001 | -000001 | -000002 , -000003 | -000004 | 16 
Ly =) ieee ee or ee = a “000001 | 000001 | 17 
al at | ‘49 43 Fy NS 46 Te 48 yo G0 ae 


0 | 016573 | -014996 | °013569 | -012277 | -011109 | -010052 | ‘009095 | :008280 , -007447 | :006738 
1 | 067948 | -062981 | 058345 | °054020 | 049990 | -046238 042748 | °039503 | -036488 | -033690 
2 | *139293 | +132261 | 125441 | +118845 | -112479 | 106348 | -100457 | ‘094807 | -089396 | -084224 
3 | 190368 | +185165 | 179799 *174305 | -168718 | -163068 | °157383 | °151691 | -146014 | -140374 
4 | 195127 | 194424 | -193284 | 191736 | :189808 | -187528 | 184925 | +182029 | *178867 | °175467 
5 | 160004 | +163316 | °166224 | +168728 | 170827 | °172525 | -173830 | °174748 | -175290 | 175467 
G | *109336 | -114321 | +119127 | 7123734 | -128120 | -132270 | 136167 | 1139798 | 143153 | 146223 
7 | (064040 | -068593 | ‘073178 | ‘077775 | 082363 | 086920 | 091426 | 095862 | -100207 | *104445 
032820 | -036011 | 039333 | 042776 | 046329 | 049979 | -053713 | 057517 | ‘061377 | -065278 
9 | -O14951 | -016805 | 018793 °020913 | -023165 | 025545 | 028050 | °030676 | -033416 | -036266 
10 | -006130 | 007058 | 008081 | °009202 | -010424 | -011751 | -013184 | 014724  -016374 | 018133 
11 | -002285 | 002695 | -003159 | -008681 | -004264 | -004914 | -005633 , 006425 007294 | 008242 
12 | 000781 | -000943 | -001132 | 001350 | -001599 | 001884 | :002206 | 002570 -002978 | :003434 
3 | -000246 | -000305 | 000374 | 000457 | 000554 | -000667 | 000798 | -000949 001123 | -001321 
14 | 000072 | 000091 | -000115 | 000144 | -000178 | -000219 | 000268 | 000325 | 000393 | -000472 
15 | 000020 | -000026 | -000033 | -000042 | -000053 000067 | -009084 | -000104 -000128 | 000157 
16 -Q00005 | 000007 | 000009 | ‘000012 | -000015  -000019 | -000025 | 000031 -000039 000049 
17 | 000001 | -000902 | *000002 | *000003 | -000004  -000005 | -000007 | *000009 | *000011 | :000014 


7) 


WOANAAK OHS 


iS |) Ga — | 000001 | *000001 | -000001 | -000001 | -000002  *000002 - -000003 | -000004 
1S) = _— ces — = — — | 000001 | -000001 | *000001 
| | 
ole of Se Nord a4 55 56 a7 eee se ye) i) OO) x 


0 006097 | °005517 | 004992 | -004517 | -004087 | -003698 | 003346 | -003028 | 002739 | 002479 
031093 | ‘028686 | -026455 | °024390 | 022477 | ‘020708 | 019072 | -017560 | 016163 | -014873 
2 | -079288 | 074584 | 070107 | -065852 | -061812 | -057982 | :054355 | -050923 | -047680 | -044618 
3 134790 | 129279 | 123856 | 118533 113323 | *108234 | +103275 | 098452 | ‘093771 | 089235 | 


ES 


Ce e®R > 


H. E. Soper 29 


TA BLE—(continued). 


m 


qr 
Le 
Or 
© 
nn 
aS) 
N 
s 
aT 
ial 
OU 
fo 
~t 
~ 
oa) 
Ne) 
QD 
S 


171857 | *168063 164109 | -160020 | *155819 | *151528 | 147167 | °142755 | 138312 | 133853) 4 
175294 | 174785 173955 | -172821 | 171401) -169711 | 167770 | -165596 | *163208 | *160623 5 
*149000 | °151480 | -153660 | 155539 | °157117 ) °158397 | *159382 | -160076 | 160488 | 160623) 6 
“108557 | °112528 | -116343 | -119987 | °123449 126717 | 129782 | +132635 | °135268 1387677) 7 | 
069205 | 073143 -077077 | -080991 | ‘084871 -088702 | 092470  -096160 | ‘099760 *103258 8 

039216 | ‘042261  -045390 | 048595 | °051866 | -055192 ‘058564 | -061970 | °065398 | 06838 | 9 
“020000 | 021976 024057 | -026241 | °028526 -030908 | °033382 | -035943 | 038585 | 041303, 10 
-009273 | °010388 -011591 | -012882 | °014263  -015735 017298 | 018952 | -020696 | 022529 | 71 
003941 | ‘004502 -005119 | -005797 | °006537  -007343 | -008216 | 009160 | 010175 | ‘011264 | 12 
001546 | 001801 | 002087 | 002408 | °002766  -003163  -003603 | -004087 | 004618 | °005199 | 13 

‘000563 | -000669 | -000790 | -000929  -001087  -001265 | 001467 | -001693 | °001946 | "002228 | 14 
‘000191 | 000232 | -000279 | -000334 000398 000472 | 000557 -000655 | ‘000766 000891 15 
000061 | ‘000075 | -000092 | -000113 | *000137 | -000165 | 000199 | -000237 | *000282 | ‘000334 16 
000018 | -000023 | -000029 | -000086 | ‘000044 | -000054 | ‘000067 | 000081 | ‘000098 | ‘000118 | 17 
“000005 | ‘000007 | ‘000008 | :000011 | ‘000014 | -G00017 | 000021 | -000026 | -000032 | ‘000039 | 18 


000001 | 000002  -000002 | -000003 | -000004 | -000005 | -000006 | -000008 | -000010  -000012 | 19 
= — 000001 000001 | 000001 | -000001  -000002 | -000002 | 000003 | -000004 | 20 
= = —- ; — ar — -= | :Q00001 | *O00001 | *OO0001 | 21 
6'l 6°2 63 Or4 6'5 O°6 oy 6°8 Org 70 we 


002243 | -002029 | -001836 | °001662 | ‘001508 | -001360 | -001231 | -001114 | -001008 | ‘000912 | 0 
013682 | °012582 | -011569 | -010634 | 009772 | ‘008978 | °008247 | 007574 | °006954 | ‘006383 | 1 
041729 | :039006 | :036441 | °034029 | 031760 | 029629 | 027628 | -025751 | -023990 | 022341 | 2 
084848 | ‘080612 | -076527 | °072595 | ‘068814 | 065183 | 061702 | -058368 | -055178 | 052129 | 3 
129393 | °124948 | :120530 | -116151 | °111822 | -107553 | :103351 | (099225 | -095182 | ‘091226 | 4 
"157860 | °154936 | *151868 | -148674 | °145369  +141969 | 138490 | 134946 | -131351 | 127717) 6 
"160491 | °160100 | -159461 | +158585 | °157483 | -156166 | °154648 | 152939 | 151053 | -149003 | 6 
139856 | *141803 | *143515 | 144992  *146234 | -147243 | :148020 | 148569 | °148895 | *149008 | 7 
"106640 | -109897 | :113018 | °115994 | ‘118815 | °121475 | -123967 | °126284 | -128422 | 130377] $& 


072278 | ‘075707 | :079113 | -082484 | 085811 | 089082 | 092286 | -095415 | 098457 | 101405 | 9 
044090 | 046938 | -049841 | -052790 | 055777 058794 | 061832 | 064882 | 067935 | ‘070983 | 10 
024450 | 026456 028545 | :030714 | °032959 | 035276 | -037661 | -040109 | -042614 | “045171 | 11 
012429 | ‘013669 | 014986 | :016381 | °017853 | -019402 | -021028 | -022728 | -024503 | 026350 | 12 
005832 | 006519 | -007263 | ‘008064 | °008926 | -009850 | :010837 | -011889 | -013005 | 014188 | 13 
002541 | ‘002887 003268 003687 | (004144 -004644 | -005186 | 005774 | ‘006410 ‘007094 | 14 
001033 | °0011938  -001373 | -001573 | 001796  -002043 | -002317 | 002618 | -002949 003311 | 14 
‘000394 | :000462 | -000540 | -000629 | 000730 | -000843 | -000970 | 001113 | 001272 *001448 | 10 
000141 | ‘000169 | 000299 | 000237 | ‘000279 *000327 | *000382 | -000445 | ‘000516 | ‘000596 | 17 
000048 | -000058 | 000070 | ‘000084 | ‘000101 000120 | :000142 | ‘000168 | 000198 -000232 | 1S 
‘000015 | *000019 | -000023 | -000028 | -000034  -000042 | -000050 | -000060 | -000072 ‘000085 | 19 
000005 | ‘000006 | -000007 | -000009 | °000011  -000014 | ‘000017 | -000020 | 000025 000030 | 20 
000001 | *090002 | -000002 | -000003 | *000003 | -000004 | 000005 | -000007 | ‘000008  -000010 | 271 

= -— | -000001 | 000001 | -000001 ‘000001 | *000002 | -000002 | -000003  -000003 | 2.2 

= = | -—- —- | — | = — 000001 | 000001 000001 | 22 


q 
q 


30 Poisson's Exponential Binomial Limit 
TABLE—(continued). 
™m 
av ax 
Hi | 7:2 13 Th 1s} v6 at 78 79 8-0 
0 | :000825 | 000747 | -000676 | -000611 | 7000553 | -000500 000453 | 000410 | -000371 | 000335 | 0 
1 | :005858 | ‘005375 | 004931 | 004523 | 004148  -003803 | (003487 | -003196 | 002929 | 002684] 1 | 
2 | 020797 | 7019352 | -018000 | 016736 | ‘015555 | 014453 | (013424 | -012464 | 011569 | 010735 | 2 
3 | 049219 | 046444 | 043799 | 041282 | 038889 036614 | 084455 032407 -030465 | 7028626] 3 
4 | (087364 | -083598 | -079934 | 076372 | (072916 069567 | 066326  -063193 060169 | 057252 | 4 
5 | 124057 | 120382 | +116703 | -113031 | °109375 105742 | -102142 -098581 ‘095067 | 091604} 5 
6 | 146800 | "144458 | 141989 | :139405 | 136718 °133940 | "131082 | -128156 125171 | *122138| 6 
7 | *148897 | *148586 | *148074 | -147371 | °146484 | °145421 | 144191 | -142802 | -141264 | 139587] 7 
8 | 132146 | -183727 | °135118 | +136318 | ‘137329 | -138150 | *138783 | -139232 | -139499 | -139587| 8 
9 | *104249 | *106982 | 109596 | *112084 | *114440 | °116660 | 118737 | -120668 | -122449 | 124077] 9 
10 | ‘074017 | 077027 | 080005 | -082942 | 085830 | ‘088661 | °091427  -094121 | 096735 | -099262 | 10 
11 | ‘047774 050418 | °053094 -055797 | °058521 | °061257 063999 -066740 | 069473 | -072190) 11 
12 | 028267 | -030251 | ‘032299 | -034408 | 036575 | 038796 041066 | -043381 | 045736 | -048127 | 12 
3 | :015438 | 016754 018137 019586 | ‘021101 | -022681 | °024324 -026029 | 027794 | 029616 | 13 
14 | 007829 | 008616 009457 | -010353 | 011304 | -012312 | 013378 -014502 | -015684 | 016924 | 14 
5 | 003706 -004136 004603 -005107 | *005652 006238 ‘006867 -007541 | 008260 | -009026 | 15 
16 | 001644 001861 002100 002362 | 002649 002963 | °003305  -003676 | -004078 | 004513 | 16 
17 | -000687 | -000788 | ‘000902 | °001028 | 7001169 | ‘001325 | 001497 | -001687 | -001895 | 002124 | 17 
18 | -000271 | 000315 | -000366 | -000423 | 000487 | *000559 | 000640 | 000731 | *000832 | 000944 | 18 
19 | -000101 | °000119 | *000141  -000165 | 000192 °000224 | *000259  -000300  *000346 | 000397 | 19 
20 | 000036 -000043 | 000051 | 000061 | 000072 | 000085 | 000100 | -000117 | 000137 | 000159 | 20 
21 | 000012 -000015  *000018 | 000021 | °000026 | -000031.| *000037 000043 | -000051 | -000061 | 21 
22 | 000004 | -000005 -000006 | -000007 | 000009 -000011 | *000013  -000015 | -000018 | -000022 | 22 
23 | 000001, -000002  *000002 | -000002 | “000003 , 000004 | *000004 | -000005 ‘000006 | 000008 | 23 
24 = = ‘000001 | -000001 | 000001  *000001 | 000001 -000002 | -000002 | *00C003 | 2 
25 = 23 = ae = mn — 000001 | 000001 | -000001 | 25 
= | ae raw 
x | 8 8-2 8:3 Sy 85 SiG) ile oa 88 | 89 90 | «x 
0 | 000304 | -000275 | 000249 | *000225 | 000203 | 000184 | -000167 | 000151 | 000136 | -000123 | 0 
1 | -002459 | ‘002252 ‘002063 | ‘001889 | -001729 | -001583 | -001449 | 001326 | 001214 | OOL111) 1 
2 | 009958 -009234 008560 | ‘007933 | 007350 006808 -006304 -005836  -005402 | 004998 | 2 
3 | 026885 | 025239 | 023683 | 022213 | 020826 | 019517 | ‘018283 | 017120 | -016025 | ‘014994 | 3 
4 | 054443 | 051740 049142 046648 | 044255 -041961 039765 | 037664 035656 | -033737 | 4 
5 | 088198 | 084854 081576 | 078368 | ‘075233 | 072174 069192 | -066289 | 063467 | -060727 | 5 
6 | 119067 | "115967 | "112847 | °109716 | "106581 *103449 | -100328 | 097224 | 094143 | 091090 | 6 
7 | 137778 snes 1305 “a105) "129419 | -127094 | °124693 | -129224 | 119696 | °117116| 7 
8 | -139500 | *139244 | -138823 | "138242 | 137508 136626 | 135604 | -134446 | 133161 | 131756 | 8 
9 | 125550 | *126866 | 128025 | "129026 | *129869 | -130554 *131084 | "131459 | "131682 | -1381756| 9 
10 | -101696 | *104031 | 106261 | *108382 | °110388 | -112277  *114043 | -115684 | *117197 | 118580) 10 
11 | :074885 | ‘077550 | ‘080179 | ‘082764 | ‘085300 | -087780 | °090197 | 092547 | °094823 | 097020 | 11 
12 | :050547 | -052993 | -055457 | °057935 | 060421 062909 | -065393 | -067868 | °070327 | °072765 | 12 
13 | 031495 | ‘033426 | 035407 | ‘037435 | °039506  -041617  °043763 , 045941 | -048147 | -050376 | 13 
14 | -018222 | 019578 | 020991 | (022461 | ‘023986 | 025565 -027196 | -028877 | 030608 | 032384 | 14 
15 | -009840 | 010703 | 011615 | °012578 | 013592 | -014657 ‘015773 | 016941 | -018161 | ‘019431 | 15 
16 | -004981 | 005485 -006025 | °006604 | °007221 | -007878 | 008577 | -009318 | 010102 | °010930 | 16 
17 | -002373 | 002646 002942 | 003263 | 003610  -003985 004389 | -004823 | ‘005289 | ‘005786 | 17 
18 | -001068 | 001205  -001356 | °001523 | 001705 | -001904 | 002121 | -002358 | 002615 | -002893 | 18 
19 | -000455 | -000520 | -000593 | °000673 | 000763 | -000862 | 000971 | -001092 | °001225 | 001370) 19 
20 | 000184 ee ia ‘000283 | -000324 | -000371 | 000423 | 000481 | -000545 | ‘000617 | 20 


H. KE. Soper 31 
TABLE—(continued). 
me 
xv rs Bo 
Sl 82 S38 Sh Sb S'6 Ont ss SD | 9:0 
21 | 000071 | ‘000083 | ‘000097 | ‘000113 -000131 | °000152 | ‘O00L75 | *000201 | *000231 | ‘000264 | 27 
22 | -000026 | ‘000031 | -000037 | ‘000043 | -000051 | ‘000059 | 000069 | ‘000081 | 000093 | *000108 | 22 
28 | 000009 | -000011 | ‘000013 | ‘000016 | 000019 | 000022 | *000026  -000031 | -000036 | 000042 | 23 
24 | 000003 | -000004 | 000005 | ‘000006 | -000007 | ‘000008  -009009 | ‘000011 000013 | ‘000016 | 24 
25 | -000001 | 000001 | ‘000002 | :000002 ‘000002 | *000003 | -000003 | °000004 | 000005 | ‘000006 | 25 
| 26 — — —- 000001 | 000001 | *O00001 | “000001 | *QOQ0OL1 | *O00002 | -000002 | 26 
2Y = — — a = — — ‘OOO0O1 | ‘OOO0001 | 27 
Da Oot 9°2 Fs a4 hes) IO Oey 9°8 99 10°0 4B 
| sons. 
0 | 000112 | 000101 | ‘OO0091 | -000083  -000075 | ‘000068 | *OO0061 | *000055 | *000050 | ‘000045 | U 
1 | 001016 | -000930 | ‘000850 | :000778 | ‘000711 | 000650 | ‘000594 | -000543 | 000497 | :000454 I 
2 | 004624 | 004276 | 003954 | 003655 | -003378 | 003121 | (002883  -002663 | 002459 | 002270 g 
3 | 014025 | 013113 | °012256 | 011452 | ‘010696 | °009987 | *009322 | ‘008698 | ‘008114 | ‘007567 3 
4, | 031906 | 030160 | 028496 | -026911 | -025403 | :023969 | ‘022606 | °021311 | 020082 | 0189174 
5 | 058069 | -055494 | -053002 | 050593 | -048266 | 046020 | 043855  °041770 | 039763 | °037833 | 5 
6 | °088072 | -085091 | °082154 | °079262 | ‘076421 | -073632 ‘070899 | °068224 | *065609 | °063055 6 
7 | 114493 | -111834 | *109147 | +106438 | °103714 | *100981 | ‘098246 | -095514 | °092790 | ‘090079 7 
8 | 130236 "128609 | °126883 | °125065 | 123160 | °121178 | °119123 | *117004 | *114827 | *112599 8 
9 | 131683 | °131467 | *131113 | 130623 | -130003 | 129256 | °128388 | *127405 | °126310] 1251109 
10 | *119832 | *120950 | °121935 | *122786 | °123502 | °124086 | °124537 | °124857 | -°125047 | -125110 10 
11 | :099188 | -101158 | *103090 | *104926 | -106661 | *108293 | ‘109819 | *111236  *112542 | 113736 11 
12 | ‘075176 | ‘077555 | ‘079895 | -082192 | -084440 | 086634 | ‘088770 | -090848 | °092847 | 094780 12 
18 | 052628 | :054885 | ‘057156 | 059431 | -061706 | (063976 | ‘066236 | 068481 | ‘070707 | (072908 13 
14 | °034205 | :036067 | °037968  -039904 | ‘041872 | °043869 | °045892 | 047937 | “050000 | °052077 14 
15 | 020751 | 022121 | °023540  *025006 | 026519 | 028076 | ‘029677 | ‘031319 | ‘033000 | ‘034718 15 
16 | °011802 | 012720 | ‘013683  -014691 | -015746 | -016846 | ‘017992  -019183 | °020419 | -021699 16 
17 | (006318 | -006884 | 007485 | 008123 | :008799 | 009513 | -010266 | -011058 | °011891 | ‘012764 17 
18 | 003194 | 003518 | -003867 004242 | 004644 | 005074 | 005532 | ‘006021 | ‘006540 | ‘007091 18 
19 | (001530 | -001704 | 001893 ‘002099 | 002322 | -002563 | °002824 | °003105 | °003408 | 003732 | 19 
20 | :000696 | 000784 | ‘000880 | 000986 | -001103 | -001230 | *001370 | °001522 | *001687 | 001866 | 20 
21 | 000302 | 000343 | ‘000390 000442 -000499 | 000563 | 000633 | 000710 | *000795 | ‘000889 | 21 
22 | 000125 | 000144 | °000165  *O000189 | 000215 | 000245 000279 | *000316 | ‘000358 | -000404 | 22 
23 | 000049 | -000057 | ‘000067 = “000077 000089 | -000102  *000118 | *000135 | 000154 | -000176 | 23 
24 | 000019 | -000022 | ‘000026  -000030 | 000035 | ‘000041 | *000048 | *000055 | ‘000064 | -000073 | 24 
| 25 | ‘000007 | -000008 | 000010 | -O00011 | *000013 | ‘000016 | “000018 | ‘000022  *000025 | -000029 | 25 
| 26 | 000002 | -000003 | *000003 | :000004 | ‘000005 | ‘000006 | *O00007 | *O00008 000010 | 000011 | 26 
27 | "000001 | 000001 | *OOO001 | *O00001 | *000002 | -000002 | *O00002 | "000008 *000004 | 000004 | 27 
28 — —- | = — 000001 | ‘000001 | ‘000001 | 000001 000001, -000001 | 28 
QO — = —- | — ee 000001 p22 
@ 10:1 10°2 L0°3 L104 L0°5 106 L0°7 L0°8 KORY) | 110 7 
0 | -000041 | :000037 | -000034 -000030 -000028 | -000-25 | ‘000023 | -000020 | -000018 | *OO0017 0 
1 | 000415 | *000379 | 000346 | 000317. °000289 | -000264 | -000241 | :000220 | 000201 | ‘000184 7 
2 | 002095 | 001934 | 001784  -001646 | -001518 | :001400 | 001291 | -001190 | -001097 ; 001010 | 2 
3 | 007054 | 006574 | °006125 | -005705 | 005313 | 004946 | 004603 | 004283 | -003984 | *008705 | 3 
1 | | | 


32 Powssovs Exponential Binomial Limit 
TABLE—(continued). 
UD 
x = ine ey ] BL 
10°1 10:2 10°3 10"4 | 10°5 | 10°6 L0°7 10°S 10:9 | 11:0 
| 
—|- ——|——|——— | _—— 
4 | O17811  -016764 | -015773 | 014834 | -013946 | -013107 | -012313 | -011564 | 010856 | -010189 |] 4 
5 | 085979 | °084199 | °032492 | 030855 | *029287 | ‘027786 | °026350 | 024978 | 023667 | 022415 | 5 
6 | °060565 | -058139 | (055777 | 053482 -051252 049089 | 046991 | -044960 | 042995 041095 | 6 
7 | 0873887 | 084716 | 082072 | 079458  °076878 | (074334 | 071830 | 069367 | 066949 064577, 7 
8 | °110326 | *108013 | *105668 | 103296  *100902 | -098493 | ‘096072 -093646 | (091218 | ‘088794 | 8 
9 | °123810 | 122415 | °120931 | -119364 “117720 | *116003 | °114219  -112375 | -110475 °108526; 9 
10 | 125048 | °124863 | °124559 | 124139 +123606  +122963 | -122215 | -121365 | -120418 | 119378 | 70 
17 | 114817 | *115782 | °116633 | °117368 | -117987 | *118492 | -118882 | *119159 | -119323 | -119378 | 71 
12 | 096637 | 098415 | *100110 | °101719 | *103239 | :104667 | *106003 | +107243 | :108386 | 109430 | 12 
13 | ‘075080 | ‘077218 | ‘079318 | -081375 | ‘083385 | °085344 | ‘087248 | -089094 | 090877 | °092595 | 13 
14 | °054165 | °056259 | 058355 | *060450 | -062539 | °064618 |} 066683 | °068730 | ‘070754 | 072753 | 14 
15 | °0B6471 | °038256 | 040071 | 041912 | °043777 | °045663 | 047567 | -049485 | 051415 053352 | 15 
16 | °023022 | 024388 | 025795 | °027243 | °028729 | °030252 | -031810 | 033403 | 035026 ‘036680 | 76 
17 | (013678 | 014633 | °015629 | 016666 | ‘017744 | °018863 | 020022 | -021220 | 022458 | 023734 | 17 
18 | :007675 | *008292 | ‘008943 | -009629 | -010351 | °011108 | -011902 | :012732 ‘013600 | 014504 | 78 
19 | ‘004080 | 004451 | *004848 | *0054%71 | -005720 | -006197 | °006703 | -007237 | 007802 | -008397 | 19 
| 20 | °002060 | -002270  :002497 | -002741 003003 | 003285 | -003586 | 003908 | -004252 | 004618 | 20 
21 | 000991 | °001103 | *001225 | ‘001357 | *001502 | °001658 | -001827 | °002010 | 002207 | ‘002419 | 21 
22 | 000455 | -000511 | 000573 | 000642 | 000717 | 000799 | :000889 | -000987 | -001093 | °001210 | 22 
23 | -000200 | -000227 | -000257 | 000290 -000327 | -000368 | 000413 | 000463 | -000518 | ‘000578 | 23 
24 | 000084 -000096 | -000110 | 000126 -000143 | 000163 | -000184 000208 | -000235  -000265 | 2 
25 | 000034 | 000039 | *000045 | *000052 , 000060 | *000069 | ‘000079 | -000090 | 000103 | ‘000117 | 25 
26 000013 | 000015 *000018 | -000021 | 000024 -000028 | -000032 000037 | -000043 | 000049 | 26 
27 | 000005 | 000006 | 000007 | *000008 | 000009 | :000011 | *000013 | :000015 | 000017 | *000020 | 27 
28 000002 | -000002 | *0000038 | *000003 | -000004 | *000004 | :000005 | *000006 | -000007 | -000008 | 28 
29 | 000001 | ‘000001 | -000001 | °000001 | 000001 | 000002 | *000002 | -000002 | -000003 | 000003 | 29 
3O — _ — a — 000001 | ‘000001 | -000001 | *000001 | ‘000001 | 30 
a let ED 113 114 TAG 5 = eto Mie 11°8 11:9 12°0 iz. 
0 | 000015 | 000014 | ‘000012 | ‘000011 | ‘000010 | *000009 | °000008 | °000008 | *000007 ‘000006 |} 0 
Z | -000168 | °000153 | 000140 | 000128 | ‘000116 | *000106 | ‘000097 | -000089 | ‘000081 | ‘000074 | 7 
2 | 000931 | -000858 | 000790 | 000727 | 000670 | ‘000617 | 000568 | 000522 | 000481 | 000442 | 2 
3 | 003445 | -003202 | 002976 | 002764 | ‘002568 | 002385 | -002214 | -002055 | 001907 | ‘001770 | 3 
4 | 009559 | 008965 | -008406 | 007879 | 007382 | 006915 | 006476 | 006062 | -005674 | 005309 | 4 
5 | °021221 | 020082 | -018997 | 017963 016979 | °016043 | 015153 | ‘014307 | -013504 | ‘012741 | 5 
G | ‘039259 | °037487 | -035778 | 7034130 | 032544 | -031017 | °029549 | 028137 | -026782 | 025481 6 
7 | 062253 | -059979 | -057755 | 055584 | 053465 | 051400 | 049388 | -047432 | 045530 | 043682 | 7 
8 | 086376 | °083970 | 081579 | -079206 | ‘076856 | 074529 | 072231 | -069962 | 067725 | ‘065523 | 8 
9 | 106531 | -104496 | 102427 | -100328 | 098204 | 096060 | -093900 | °091728 | 089548 | ‘087364 | 9 
10 | “118249 | -117036 | 115743 | 114374 | +112935 | *111430 | *109863 | 108239 "106562 | °104837 | 10 
11 | 119324 | +119164 | °118899 | °118533 | -118068 | *117508 | °116854 | *116110 | -115281 | 114368 | 11 
12 | *110375 | 111220 | +111964 | *112607 | *113149 | °113591 | +113933 | -114175 | 114320 | *114363 | 72 
13 | °094243 | 095820 | -097322 | 098747 | -100093 | °101358 | -102539 | -103636 | -104647 | 105570 | 13 
14 | 074721 | (076656 | -078553 | -080409 | -082219 | 083982 | 085694 | :087350 | -088950 | 090489 | 14 
15 | 055294 | 057236 | 059177 | 061110 | -063035 | -064946 | 066841 | -068716 070567 | 072891 | 15 
16 | -038360 | ‘040065 | -041793 | 043541 | 045306 047086 | 048877 | ‘050678 | 052484 | -054293 | 16 
17 | 025047 | -026396 | -027780 | -029198 | 030648 | °032129 | :033639 | 035176 | 036739 | ‘0388325 | 17 
18 | 015446 | 016424 | -017440 | -018492 | 019581 | -020706 | 021865 | ‘023060 | -024288 | 025550 | 18 
19 | -009023 | 009682 | -010372 | ‘011095 | 011852 | :012641 | 013465 | -014322 | -015212 | ‘016137 | 19 


Se ee ee ee 


ay 


ee ee ee ee 


Po pain hs: 


H. E. Soper 


TA BLE—(continued). 


Vit 
Lv = = = =F = — 
111 11-2 113 [14 HN, | LTO 9) eat te? 
| | | | 
20 | 005008 | -005422 | 005860 | -006324 -006815 | 007332 | ‘007877 
21 | :002647 | 002892 -003153 | :003433  -003732 | ‘004050 | 004388 | 
22 | 001336 | -001472 001620 001779 | -001941 | ‘002136 | -00233 
23 | 000645 | -000717 | -000796 | 000882 | -000975  -001077  -001187 
24 | °000298 | -000335 | 000375  -000419 | -000467 | 000521 | -000579 
25 | 000132 | -000150 000169 | 000191 | 000215 | 000242 | -000271 
26 | °000057 | 000065 000074 -000084 | 000095 | 000108 000122 
| 27 | 000023 | 000027 | 000031 —-000035 “000041 "000046 | -000053 
28 | ‘000009 | -OVDOL1 | 000012 -000014 | ‘000017 | -000019 | 000022 
| 29 | -Q00004 “O00004 000005 *OO0006 | *OO0007 | *O00008 | -~OO0009 
| 30 | -000001 | -000002 ‘000002 | -000002 | -000003 | -000003 | -000003 
| el *YODVO1 | -000001 | *000001 | 000001 | *OO0001 | ‘000001 | 
OY | — -- — — -— 
| F = 
i || StF 12:2 12°3 124 io 12°6 12°7 
| r | “aanel| 
0 000006 | -000005 | *000005 | 000004 -000004 | *000003 000008 | 
1 000067  -000061 | *000056 | 000051, 000047. *000042  -000039 
2 000407 000374 | 000344 -000317 000291  -000268 | -000246 
3 001641 | -001522 | °001412 | -001309 | -001213 | 001124 | -001042 
4 | 004966 | -004643 | °004841 | 004057 | -003791 | 003541 | 003307 
5 | 012017 | -011330 | 010679 | -010062 | -009477 | ‘008924 | 008400 | 
6 | 024233 | -023037 | °021892 | 020794 | -019744 | (018740 | -017781 
7 | 041889 | 040151 | 038467 | 036836 | -035258 | ‘033733 | -032259 | 
8 | 063358 | -061230 | °059142 | -057095 | 055091 | °053129 | 051212 
9 | -085181 | -083000 | ‘080828 | -078665 | 076515 074381 | (072266, 
10 | *103069 | 101261 | °099418 | ‘097544 | -095644  °093720  -091777 
11 | 113876 | °112308 | 111168 | +109959 | -108686 | 107352 | °105961 
12 | *114321-| -114180 | 113947 | -113624 | -113215 | °112720 | -:112142 
13 | °106406 | -107153 | °107811 | -108380 | -108860 | °109251 | *109554 
14 | 091965 | :093376 094720 095994 | -097197 | ‘098326 | -099381 
15 | 074185 | ‘075946 | °077670 | 079355 | -080997 | °082594 | 084143 
16 | 056103 | -057909 | °059709 /-061500 | (063279 ‘065043 | 066788 | 
17 | :039932 | -041558 | °043201 | 044859 | -046529 ‘048208 | -049895 | 
18 | ‘026843 | 028167 | ‘029521 | -030903 | -032312  °033746 | °035204 
19 | -017095 | -018086 | ‘019111 | -020168 | -021258 | °022379 | 023531 
20 | -010342 | -011033 | ‘011753 | -012504 | -013286 | 014099 | 014942 
21 | 005959 | -006409 | ‘006884 | -007383 | 007908 | ‘008459 | -009036 | 
22 | -003278 | 003554 | 003849 | 004162 | 004493 | ‘004845 | -005216 
23 | 001724 | -001885 | ‘002058 | -002244 | -002442 | ‘002654 | 002880 
24 | 000869 | 000958 | -001055 001159 | 001272 | *001393 001524 | 
25 | 000421 | -000468 | 000519 | -000575 | 000636 | ‘000702 | -000774 
26 | :000196 | :000219 | -000246 | -000274 | -000306 | °000340 | -000378 | 
27 | -000088 | -000099 | ‘000112 | 000126 | -000142 | ‘000159 | 000178 | 
28 | -000038 | 000043 -000049 | -000056 | -000063 | ‘000071 | -000081 | 
29 | 000016 -000018 000021 | -000024 | -000027 | -000031 | -000035 
30 | 000006 | -000007 000009 -000010 | -000011 | (000013 | ‘000015 
31 | -000002 | -000003  -000003 000004 | 000005 | -000005 | 000006 
32 | 000001 | -000001 000001 -000002 ‘000002 000002 | 000002 | 
Bo = | = — “000001 | 000001 | “000001 | *O00001 
BY ee ee = == i) = ae 


Biometrika x 


ao 
ints ———— rs 
it} || rile) | BO) 
008450 | -009051 | -009682. 20 
004748 | 005129 -005533. 21 
| -002547 | 002774 | 003018 | 22 
001307 | 001435 | 001575. 23 
000642 °000712 | ‘000787 24 
“000303 | 000339 | ‘000378 25 
000138 | -000155  *OOO1L74 26 
“000060 | *O00068 | 000078 | 27 
‘000025 | -000029 | ‘000083 | “5 
© 000010 000012 | -O00014 | 79 
“000004 | 000005 -000005 | 3 
‘000002 | -000002 - *000002 3 
*000D01L | 000001 | “000001 | 37 
I 

12'8 12°9 13:0 xv 
000003 | 000002 | 000002} 0 
000035 | -000032 | *000029 | 1 
“000226  *000208 | ‘000191 2 
000965 | ‘000894 | *000828 | 6 
003088 | °002882 | *002690 | 4 
007905 | 007436 | 006994 | 5 
016864 | -015988 | °015153 | 6 
030837 | -029464 | ‘028141 | 7 
049339 | °047511 | 045730 | 8 
‘O70171 068100 -066054) 9 
089819 | -087849 | ‘085870 | LO 
104516 | *103023 | *101483 | 11 
-111484 | -110749 | :109940 | 12 
-109769 | -109897 | (109940 | 13 
100360 | -101263 | "102087 |) 14 
085641 | ‘087086 | 088475 15 
(068513 | ‘070213 -071886 16 
‘051586 | 053279 | -054972 | 17 
036683 | 038183 039702 | 18 
024713 | (025925 -027164 | 19 
015816 | 016721 °017657 | 20 
009640 | 010272  °010930 ) 2 
‘005609 | -006023 | ‘006459 | 22 
003122 | :003378 ‘003651 | 25 
001665 | ‘001816 | 001977 | 24 
000852 | ‘000937 | °001028 | 25 
000420 000465 | 000514 | 26 
“000199 | -000222 | 000248 | 27 
000091 , 000102 ‘000115 | 28 
000040. -000046 | *000052 | 29 
000017 | -000020 | *000022 30 
“000007 ‘000008 | 000009 31 
“000003  -000003 | ‘000004 52 
“000001 | -OO0001 | -000002 | 3: 
SS = 0000014173 


5 


34 Poissons Eaponential Binonial Limit 


TABLE—(continued). 


m 


| | | \ 
13°2 LO258 Wy lovin A elo olen elo G 137. | 13:8 13°9 1L‘0 | 
| 


> St Ss INO 


© © % Ww & 7 


9 | *000568 | -000626 | ‘000689 | :000758 


000002 ‘000002 *000002  -000002 | 000001 ‘000001 *000001 | *000001 000001 ‘000001 
000027 | ‘000024 | *000022  -000020 | -000019 | ‘000017 000015 | 000014 | -000013 | -000012 
000175 ‘00016 | *000148 000136 -000125 | ‘000115 *000105 | -000097 | -000089 | *000081 | 
‘000766 | 000709 | °000657  -000608 000562 | -000520 000481 | -000445 000411 | 000380 | 
‘002510 | *002341 | 002183 | -002035 -001897 | ‘001768 °001648 | 001535 -001429 | ‘001331 | 
"006575 006180 °005807 | 005455 005123 ‘004810 004514 | -004236 ‘003974 | 003727 
"014356 013596 | *012872 | -012183 | 011526 | 010902 *010308 009743 | 009206 | :008696 
‘026867 | 025639 | "024458 | -023322 022230 | ‘021181 | 020173 | 019207 | -018280 | 017392 
"043994 -042304 -040661  -039064 -037512 | -036007 084547 -033132 031762 | 030435 
‘064036 | °062046  *060088 | 058161 | -056269 | -054410 | 052588 | -050802 | -049054 | 047344 
| 083887 | ‘081901 | ‘079916 | -077936 075963 | -073998 | -072046 | ‘070107  -068185 | -066282 | 10 
‘099901 098281 | 096626 | -094940 | 093227 -091489 -089730 | -087953 ‘086162 | -084359 | 11 
ee HOBLOO fine a aon ue 104880 | *103687 102441 | -101146 | ‘099804 | -098418 | 1? 
| "109898 | *109773 | *109566 | -109279  -108914 | *108473 | °107957 | -107370 | *106713 | *105989 | Lo 

"102833 | *103500 “104087 | 104595 *105024 | -105373  +105644 105836 | 105951 | *105989 | 14 
_ 089807 | °091080 | °092291 | -093439 | -094522 | 095539 , 096488 | ‘097369 | 098181 | 098923 | 15 


| 073530 075141 | ‘076717 | -078255 079753 | -081208 | -082618 | 083981 | -085295 | 086558 


WH OHMWAAK Cs WHS 


a 
D 


‘056661 | °058345 | -060019 | 061683  -063333 | -064966 ‘066580 | -068173 | 069741 | 071283 | 17 
041237 | *042786 | °044348 | 045920 | -047500 | -049086 | (050675 | °052266 | 053856 | 055442 | 18 


039400 | :040852 | 19 
027383 | ‘028597 | 20 
018125 | 019064 | 1 
011452 | :012132 | 22 


028432 029725 | 031043 | -032385 | 033750 | 035135  °036539  -037962 
| 018623 | 019619 | 020644 | -021698 | -022781 | °023892  -025030 | -026193 
‘O11617 | °012332 | ‘013074 | 013846  -014645 | -015473 016329 | -017213 
‘006917 | °007399 | 007904 | -008433  -008987 | 009565 -010168 | -010797 
003940 | -004246 | ‘004571 | 004913 -005275 | ‘005656 006057 , ‘006478 | :006921 | :007385 | 25 | 
“002151 | 002336 | ‘002533 | :002743 | 002967 | 003205 | 003457 | -003725 | 004008 | 004308 | 24 | 
001127 | ‘001233 | 001348 | 001470  -001602 | 001744 | 001895 | ‘002056 | 002229 | -002412 
000832 -000912 | 000998 | -001091 | ‘001191 | :001299 | 26 | 
000275 | ‘000306 | °000340 | 000376  -000416 | 000459 | -000507 | 000558 | (000613 | :000674 | 27 
‘000129 | -000144 | ‘000161 | 000180  -000201 | :000223 | :000248 | 000275 ‘000305 | 000337 | 28 | 
000058 | “OO0066 ‘000074 | 000083 | -000093 | 000105 | -000117 | -000131 -000146 | :000163 | 29 | 
000025 | -000029 | 000033 | -000037 | 000042 | 000047 ‘000053 “000060  -000068 | ‘000076 | 30 | 
000011  -000012 | :000014 | -000016 | 000018 | °000021 | -000024 | °000027  -000030 | :000034 | 31 | 
"000004 | 000005 | -000006 | -000007 | ‘000008 | -000009 | 000010 | -000012 | 000013 | ‘000015 | 22 | 
“000002 | -000002  -000002 | -000003 | ‘000003 | 000004 | 000004 | *000005 | ‘000006 | ‘000006 3S | 

| 000001 | ‘OO0001 | ‘000001 | -000001 | -000001 | -000001 | -000002 | ‘000002  -O00002 | 000003 of 
D —- |; — a — | :000001 | ‘000001 | ‘000001 | ‘OO000I | ‘000001 | 35 


141 | Ife | 14°38 4 | 145 146 et 14-8 | 14-9 15°0 “ 
| 4 
0 |} *Q00001 | 000001 | 000001 , 000001 000001 | = — | —- —— a ei =. | © 5 
1 | 000011 | -000010 | 000009 | -000008 — -000007 | 000007 | ‘000006 | ‘000006 | -000005 | *000005 1 : 
2 | 000075 | ‘000069 | -O00063 » 000058 | *000053 | 000049 | -000045 000041 | ‘000088 | *0000384 | 2 
3 | *0G0352 000325 | 000300 | ‘000277  :000256 | ‘000237 | 000219 | -000202 | ‘000186 | ‘000172 | 3 
4 | 001239 -001153  -001073 | 000999  -000929 | -O00864 | 000803  -000747  -000694 | 000645 | 4 
5 | 003494  -003275 -003070 | -002876 -002694  -002523  -002362 002211 | ‘002069 | °0019386 | 3 | g 
6 | 008212 -007752 | ‘007316 | “006902 -006510 | ‘006139 | 005787 °005454 | 005138 | (004839; 6 ¢ 
7 | (016541 | -015726 | 014946 -014199  -013486 | -012804 | -012152. -011530 -010937 | 010370) 7 | ; 
& | 029153 | 027913 | 026715 | 025559 | -024443 | -023367 | -022330 | 021331 | -020370 | 019444 | 8 2 
9 | 045673 | 044040 | 042447 | -040894 | 039380 | ‘037907 | (086472 | "035078 | 033723 | ‘032407 9 
10 | 064399 | -062537 | 060700 | -058887 057101 | 055343 | 053614 | -051915 050247 | 048611 10 
11 | :082547 | 080730 | -078910 | 077089 | -075270 | 073456 | ‘071648  :069850  -068062 | ‘066287 | 17 
\ | 


7 
Ds 
er 


H. E. Soper 


TABLE—(continued). 


m 


35 


Lyl | 142 Lyd | Lh 4 1d | 146 TH 148 19 15-0 
s | = | Rae ae - ; a 
§-096993 095530 | -094034 | -092507  -090951 089371 ‘087769 | 086148 | 084510 | 082859 12 
105200 | -104349 | 103437 -102469 | -101446 | *100371  -099247 | 098076 | -096862 | 095607 13 
105951 *105839 | 105654 -105396 | -105069 | -104672 *104209 103681 | 103089 | 102436 14 
§ -099594 | -100195 | 100723 -101181 | *101567 101881  *102125 | -102298 | 102402 | -102436 15 
‘087768 088923 | -090021 091063. -092045 092967 -093827 | -094626 095361 | 096034 16 
072795 | -074277 | 075724 077135 | ‘078509 | -079842 | -081133 | 082380 | -083581 | 084736 17 
(057023 *058596 | -060158 -061708 063243. ‘064761 -066259 067735 069187 | 070613 18 
042317 043793 | 045277  -046768 | 048264 -049763  -051263 | 052762 | 054257 | 055747 19 
029834 031093 | 032373 ‘033673 034992 -036327 037678 -039044 | 040422 | ‘041810 20 
020031 | -021025 | 022045 -023090 | ‘024161  -025256 026375 027517 028680 | -029865 27 | 
012838 -013570 | 014329 | 015114 -015924 -016761  -017623 | 018511 | 019424 | 020362 22 
007870 008378 | ‘008909 | 009462 | 010039 ‘010640 -011264 011911 | -012584 | 013280 23 
(004624 -004957 | 005308 | 005677 | 006065 006472 -006899 “007345 °007812 | 008300 24 
002608 -002816 | 003036 | 003270 ‘003518 -003780  -004057  °004348  -004656 | 004980 25 
001414 001538 | 001670 “001811 -001962 “002123 -002294 002475 | -002668 | -002873 | 26 
“000739 ‘000809 | -000884 -000966. ‘001054 -001148 | -001249 “001357 001473 | -001596 27 
000372 -000410 000452 | 000497 | 000546 000598 -000656 -000717 “000784 | 000855 28 
‘000181 -000201 | -000223 | -000247  -000273 000301 -000332 000366 “000403 | 000442 29 
‘000085 “000095 | “000106 “000118 | °000132 “000147 | -000163 -000181 | -000200 | 000221 40 
000039 “000044 000049 | -000055 000062 | 000069 -000077 000086 -000096 | -000107 4 
“000017 {000019 | 000022 | 000025 | -000028 | -000032 -000035 | -000040 | “000045 000050. 32 
000007 -000008 | 000009 000011 | -000012 “000014 -000016 000018  -000020 | -000023 33 
§ 000003 -000003 | -000004 -000005 -000005 | 000006 000007 000008 | 000009 | 000010 34 
000001 “000001 | -000002 000002 000002 “000002 “000003 000003 000004 | 000004 35 
— 900001 | 000001 -000001 | -000001 000001 -000001 | 000001 | «000002 | -000002 36 
=e aa eae ap as ae = 000001 | -000001 ‘000001 37 


ON THE POISSON LAW OF SMALL NUMBERS. 


By LUCY WHITAKER, BSc. 


PART I. THEORY AND APPLICATION TO CELL-FREQUENCIES. 


(1) Introductory. 


Let p denote the probability of the happening of a certain event A, and 
q = 1-—p, the probability of its failure in one trial. Then it is well known that 
the distribution of the frequencies of occurrence n, n — 1, n — 2... times in a series 
N of n trials is given by the terms of the point binomial 


VG @ eo a/) MMH rMn Raa hone yoAusdcbbooodote. (1). 
The fitting of point-binomials plotted on an elementary base c to observed 
frequency distributions has been discussed by Pearson*, and he has indicated that, 
if c be unknown, the problem can be solved in terms of the three moment coefficients 
Hy x, M4 required to find c, p and n. In actual practice but few cases of frequency 
ean be found which are describable in terms of a point-binomial, and of these few 
a considerable section have n negative, p greater than unity and q negative; thus 


defying at present interpretation, however well they may serve as an analytical 
expression of the frequency. . 


The hypothesis made in deducing the binomial (p+ q)” as a description of 
frequency is clearly that each trial shall be absolutely independent of those which 
precede it. In this respect it may be said that binomial frequencies belong to the 
teetotum class of chances, and not to those of card-drawings, when each drawing 
is unreplaced. In the latter case the “contributory cause groups are not inde- 
pendent,” and our series corresponds to the hypergeometrical rather than to the 
binomial type of progression ft. 

Using the customary notation 8, = p;°/"s°, Bo = M4/M, the binomial is determined 
from : 

n=2/{8—B.+B}, c=ov6—28,432, 
Pq =$ 8B — B+ B,)/(6 — 28, + 38;) 


* «Skew Variation in Homogeneous Material,” Phil. Trans. Vol. 186, A, p. 347, 1895. 
+ Phil. Trans. Vol. 186, A, p. 381, 1895. 


Lucy WHITAKER 37 


In order that n should be positive, it is needful that 
3—-B,+ 8, =4(6—28,+ 28), 


should be positive. If this is satisfied clearly c will be real because 8, is always 
positive. Further then 
1 6— 28,4 28, 

Ree) eS Goa eae) 
is always less than a quarter and p and q will therefore be real. If the reader 
will turn to Rhind’s diagram, Biometrika, Vol. vit. p. 131, he will see that the line 
3—6.+ 8,=0 cuts off all curves of Types III, IV, V and VI, and includes a 
portion only of Type I, with a part of its U and J varieties. The binomial 
description of frequency, therefore, is not—considering our experience of frequency 
distributions—likely to be of very universal application. 


(2) Further Linutations. 


Now let us still further limit our binomial by supposing : 


(i) that the unit of grouping of the observed frequencies corresponds to the 
actual binomial base unit c and (ii) that the first of the observed frequencies 
corresponds to the term Np” of the binomial*. 


In this case the mean m of the observed frequency measured from the first 
term of the frequency will be equal to the nq of the binomial and the standard 
deviation of the observed distribution will be equal to Vnpg. We have thus: 

Doo KG — LC, TN —O) sc cn cda ete nscrwes: (111) 
and n and q will both be negative, if m be less than o% The condition for a 
positive binomial is therefore that o be less than 4/m. 

(3) Probable errors of the constants of a Binonual Frequency. 

It is desirable to find the probable errors of p and n as determined by these 
formulae. We have: 

pa = nq, M2 = MPs 
buy’ = Gqdn+ nog, Sp. = pon + ngdp + npog, 
assuming deviations may be represented by differentials. 
Hence, since dp = — dq: 
Ou. —(p — q) du, = Gon and pdm,’ — bu. = nqdq. 
Square each of these results, sum for all samples and divide by the number of 
samples, and we have: 
Oe at qy Cie 2(p—-q) C5 Oi) City iy Gon 
2 oot Vi ,=Nn20"%o,2. 
Oe HD fay! 270 4% p, a Hy fg 

* The exact nature of these limitations must be fully appreciated. The best fitting binomial to 
a given frequency distribution will usually be far from one in which the first term of the binomial 
corresponds to the first observed frequency. The modes of the binomial and the observed frequency 


will closely correspond, but the ‘‘tails” of the binomial may be quite insignificant and correspond to no 
observed frequencies. 


38 On the Poisson Law of Small Numbers 


Now (ai is the standard deviation of variations in ~. and therefore 
oO", = (fa — pe’)/N. 
Similarly oy! is the standard deviation of variations in the mean and therefore 


Lo Heo sures t f 1 ween 
o*,/=Ms/N. Lastly the product o,,0,/7,,,,' measures the correlation betwe 


deviations in #, and yp,’ and is known to be p,/N*. 
Thus we have: 


1 
eer ee Bo pay q)? fa = 2(p—@) pst, 


wePay = + {a — Pa? + Pe — pps}. 


Butt fy = npg {1 +3(n—2) pg}, | 


een Arrs bh 00 (iv). 
= mpg(p-G, He = npg | 
Whence after some purely algebraical reductions we deduce: 
n p ad 
r= >= Pe el aN | erm Aan (bet |) | adodoccdcous00s ' 
° VNY (1- 4) - me A) aes 2(1- *) ©) 
we pasl E noe Sp . 
Op) = Oy vv 2 ae ip Cae aaa (vi). 


Formulae (v) and (vi) are very important; they enable us to obtain the 
probable errors for x and p when a binomial limited in the present manner is 
fitted to a frequency distribution. 


We see at once, that as n grows large and q grows small 
0, =o, approaches the limit V2/N, 


or the probable error, 67449 /2/N, of p and q is finite. But o® being finite op 
becomes infinitely great, or the probable error of n indefinitely large. Thus when 
the n of the binomial is very large, g being very small, the probable error of its 
determination is so great that its actual value is not capable of being found 
accurately. Again, suppose V embraced 200 observations, the probable error of q 
would be of the order ‘07; if NV corresponded to only eighteen observations, then 
the probable error of q would be of the order ‘22. It is clearly wholly impossible 


* Biometrika, Vol. 11. ‘‘On the Probable errors of Frequency Consiants,” see p. 275 (iv), p. 276 (vii), 
and p. 279 (xii). 
_ + Phil. Trans. Vol. 186, A, p. 347, 1895. 
+ There is no difficulty in obtaining the probable errors of n and p from the more general values 
in (ii). In this case 


t= 0No%g +o Be 208, 78,7B,p,» 


Cp=oq= es Ve? ,$42B,"0°2, — 2B, og %R,7,2,° 


The values of TB.» Fg, and "g,p, tor different values of 8, and B, have been tabled by Rhind, Biometrika, 
Vol, vit. pp. 136—141., 


Lucy WHITAKER 39 


from series of observations even of the order 200, much less of order 18, to assert 
that g is or is not really a “small quantity.” Thus the observed value of q corre- 
sponding to a population of extremely small q might easily show g=°15 to °50!. 


(4) Poisson—Law of Small Numbers. 


A last limitation of the point-binomial is made by supposing the mean m = nq 
to remain finite, but g to be indefinitely small. We write : 


We 


N(p+ qr =N(Q-g4qr=N-gt (1+ 4) 


me me 


=N(1-q)? (14+)! nearly 


Ne aa (1 +m So + ei + a) ; 

Here the successive terms give the frequency of occurrence of 0, 1, 2, 3... 
successes on the basis of each success not being prejudiced by what has previously 
occurred. This is the Law of Small Numbers. It was first published by Poisson 
in 1837*. It was adopted later by Bortkewitsch, who published a small treatise 
expanding by illustrations Poisson’s work+. The same series was deduced later 
by “Student” in ignorance of both Poisson and Bortkewitsch’s papers, when 
dealing with the counts made with a haemacytometert. 


The mean is at m from the first group, the other moments as “Student” has 
shewn § are: 
f,.=™M, pg=mM, py =3m?4+m. 
Hence B,=1/m, B,.-—3=1/m. 


When the mean value is large, 8,, 8, and the higher §’s approach the values 
given by the Gaussian curve. 


Clearly the Poisson-Exponential formula contains only the single constant 
m = p, and its probable error is therefore °674490//N = 67449 m This will, 
if V be reasonably large and m not too big, be a small or at any rate a finite 
quantity (i.e. not like o, for g very small). Hence it might be supposed, although 
erroneously, that the Poisson-Exponential formula was capable of great accuracy 
in addition to its great simplicity. But this is to neglect the fundamental 
assumptions on which it is based, namely: 


(i) that the data actually correspond to a binomial, 
(ii) that in that binomial g is small and n large. 


Clearly (i) shows us that, if we can find the binomial, it will actually be closer 
to the observed frequency than Poisson’s merely approximate formula. 


* Recherches sur la Probabilité des Jugements. Paris, 1837, pp. 205 et seq. 

+ Das Gesetz der kleinen Zahlen, Leipzig, 1898. 

£ “On the Error of Counting with a Haemacytometer,” Biometrika, Vol. v, pp, 351—5, 1907. 
§ They may be deduced at once from (iv). 


40 On the Poisson Law of Small Numbers 


Secondly (11) can only be justified as an assumption by actually ascertaining 
the form of the binomial from the data and testing whether n is large and q small 
and positive. It appears absurd to base our formula on an approximation to 
a binomial of a particular kind when, on testing in the actual problem, such a 
binomial does not describe the results. As a merely empirical formula, the 
Poisson-Exponential of course can be tested by the usual processes for measuring 
goodness of fit, but no such test nor any discussion of the probable errors of their 
results have been provided by Bortkewitsch himself nor by Mortara, who has 
followed recently his lines in a work to be considered Jater. Asa matter of fact in 
the cases dealt with by Bortkewitsch, by Mortara and by “Student,” n will be found 
almost as frequently small and negative as large and positive, and q takes a great 
variety of values large and negative and large and positive, as well as small 
and positive. Thus the initial assumptions made from which the “law of small 
numbers” is deduced are by no means justified on the material to which it has so 
far been applied. 


(5) Application of the Law of Small Numbers to determine the Probable Errors 
of Small Frequencies. Given a distribution of frequency for a population NW let ny 
be the frequency in the cell of the sth row and éth column of a contingency table 
(or if we drop t, 7, would stand for the frequency of any class). Then if we take 
a random sample of WV individuals from this population, the chance that an indivi- 


dual is taken out of the 7, cell is fis/|N, and that it is not is 1— 7: Therefore if 


the original population be so large that the withdrawal of an individual does not 
affect the next draw, the frequency of individuals in M random samples of WV will 
be given by the terms of the binomial : 


M {( 1") + ke 


Now, if jig/N be very small, and WN large this will approximate to the 

Poisson series : 
Me (1 +m+ stat =) : 

where m == x V. But 7y/N will approximately be the mean proportion of the 
whole in the st cell of the sample itself =ny/N, or m= ny. Thus if in any cell of 
a contingency table, or in any sub-class of a frequency whatsoever, we have a 
frequency ny small as compared to the population V, then in sampling, this small 
frequency will have a distribution approximating to the Poisson Law, and tending 
as my, becomes larger to approach the Gaussian distribution*. It would appear, 


* Such approach is usually asswned when we speak of 


67449 Alf nae ( e “) 


as the probable error of the frequency n,. But such a ‘probable error’’ has really no meaning if 7, 
be very small and the exponentiai law be applied. 


Lucy WHITAKER 41 


therefore, that the Poisson Law of Small Numbers should be applied in order to 
deal with the errors of random sampling in any small frequency, and an appeal 
should not be made—as is usually the case—to Sheppard’s Tables on the assump- 
tion that the frequency is Gaussian. 


The following Table I illustrates the results obtained (a) from the Binomial, 
(b) from the Poisson-Exponential and (c) from the normal curve on the two 
hypotheses that (1) the frequency is 10 in the 1000 and (6) is 30 in the 1000. 
But here a word must be said as to which Gaussian is to be compared with the 
Binomial or the Poisson-Exponential. The usual method of fitting a Gaussian is 
to give it the same mean and standard-deviation as the material to which we are 
fitting it. For example, we should compare the Poisson exponential with a Gaussian 
at mean m and with standard-deviation ym, or the point binomial with mean ng 


TABLE IL. 


Comparison of Binomial, Poisson-Exponential and Gaussian for cell-frequency 
variations in samples for case of 10 and 30 in a total population of 1000 


Percentage Frequency 


| 
10 in 1000 30 in 1000 
; = | a a =| 
Binomial | Reese Gaussian Binomial | feo Gaussian 
0 00004. | = =-OU0005 ‘00132 19 | -00848 | ‘(00894 = ‘01100 
1 ‘00044. (00045 =|) 00327 20 ‘01287 ‘01341 01553 CO 
2 ‘00020 ‘00227 | ‘007385 Bil ‘01857 sO1OIG 9) <O2718" 9) 
3 00739 ‘00757 ‘01491 22 ‘02556 02613 =| © :02792 =| 
4 01861 | -01892 | -02736 28) ‘03362 08408 | *08544 | 
5 | -03745 03783 | -04539 | 2, | -04233 | -04260 | -04373 
6 ‘06274 ‘06306 ‘06806 25 05110 | 05112 =| «=-05198 
or 7 “08999 “09080 ‘09224 26 05927 =| ‘05898 ‘05970 
S 8 "11282 *11260 “11300 27 | ~ 06613 ‘06553 ‘06625 
Es 9 “12561 Sb abet 28 ‘O7107 07021 | ‘07104 
oo | 29 ‘07367 | ‘07263 =| ‘07360 
oH | 
8 |10| -12574 ‘12511 "12526 
E | 30 | -07375 ‘07263 ‘07367 
eS 11 114381 11374 | 11334 
12 (09516 | °09478 | :09271 | 31 07137 ‘07029 ‘07126 
13 ‘07305 ‘07291 ‘06854 | 32 “06684 06590 =| ~=-06659 
14 "05202 705208 | 04580 | 33 ‘06064 05991 =| -06013 
15 03454 | 03472 = :02°767 34 05334 05286 =| 05246 
16 °02148 ‘02170 ‘O1511 Bo. 04553 04531 04423 
wy *01256 ‘01276 ‘00746 36 ‘03775 ‘08776 =| *03602 
18 ‘00693 ‘00709 00333 =| 37 ‘03042 ‘(03061 | *02835 
19 ‘00362 ‘00373 00134 38 *02384 02417 *02156 
20 ‘00179 | = ‘00187 “00049 389 ‘01819 ‘01859 01584 
21 ‘00085 ‘00089 ‘00016 40 01351 ‘01394 ‘01125 
22 ‘00038 00040 = 00005 41 ‘00979 (01020 ‘00771 
ASS 00016 | -00018 ~~ -0001 42 ‘00691 ‘00729 “00511 


Biometrika x 6 


42 On the Poisson Law of Small Numbers 


and standard-deviation “npg. These will, however, not be identical standard 
deviations as p is not truly unity. In ordinary practice, in testing for example the 
30 in 1000 frequency, we should put the centre of our Gaussian at our 30 group, 
and use a standard deviation = V30 (1—30/1000) = /30 x ‘97 = 5°39444 to enter 
the table of the probability integral. This is, of course, the Gaussian we obtain 
by the method of least squares, but to assume that it is “the best” is to argue in 
a circle, because we then take least squares as a test of what is best*. It is 
not the Gaussian which is directly reached by proceeding either to a limit of the 
Binomial or to the Exponential, for example, by applying Stirling’s Theorem. It 
will be seen by examining Table II that the Gaussian curve develops out of the 
exponential by a mode at the point midway between the two equal terms, rather 
than by a mode at the mean, which coincides with the centre of the second of 
them. If we apply Stirling’s Theorem to the term+ 


in 


N La nr gr 
|n —r |r eve 
of the binomial NV (p+q)”" it becomes 
No g-kr- mgt 4 (p- 9-9), 


U, e 


V 2a Vinpq 
i.e. the ordinate of a Gaussian curve of Standard Deviation Vnpq and mean at 
ng—4(p—q). These give for the Poisson-Exponential the Gaussian with standard- 
deviation ,/m and mean m—- }. The above type of curve which gives frequencies 
by coordinates and not by areas has been termed by Sheppard a ‘spurious curve 
of frequency’; at the same time it is the method by which Laplace and Poisson 
first reached the normal curve, and the real point at issue is whether we shall get 
better approximations to the discontinuous frequencies of the binomials by using 
Gaussian ordinates than by using the areas of a Gaussian curve. At the same 
time it has been shewnt that if a Gaussian curve gives a series of frequencies by 
its areas, then if its standard-deviation be o?, a spurious Gaussian frequency curve 
with standard deviation given by o,?= o°? + +;h’, h being the sub-range, will closely 
give the frequencies by its ordinates. It seems probable therefore that the 
Gaussian curve with mean at ng—4(p—q) and standard deviation Vnpq— 5 
will more closely represent the binomial for cell frequency variation by its areas, 


* There is a further flaw in this treatment—the Gaussian is continuous, the Binomial and the 
Poisson-Exponential are not. If t, be the rth term of either of the latter series, we ought really to 


make 
r+1-m N = a 9 
So” [je- | ——eé Fash alee 
r—m N2ro 


a minimum by the conditions du/dm=du/dc=0. No complete solution of this problem has hitherto 
been determined. 


Die 


+ The final form for wu, may be obtained by neglecting the terms in n in the formula given by 
Pearson, Phil. Trans. Vol. 186, A, p. 347, footnote. i 
+ Biometrika, Vol. ur. p. 311, 


* 
7 
rz,* 
et) 
- 


a 


t - 7 
2 “7 : 
7 = 
> 


ic 


a 
. - 
a 
= 
a 


Frequencies {total 1000) 


150 


100 


10) 


Biometrika, Vol. X, Part I. 


150 


LO 


oO 


Catisstarn. 


Plate VII 


35 


: Gatisstan. 
ied 


Frequencies (total 1000) 


Biometrika, Vol. X, Part I. Plate VII 


10\0- aes Mee 
=| 
1 4 —_ 
350, 150 | 1 aoe 
— 5]O 
Mp9 1 
300 100; 
| 
250 Tegal 501 
200; OG 5 10 
De 
1501 
Caussian 
4 \ 
“TIE 7.2/0 | 
100) 100) J , we 
| pa r] 
| } | b 15 20 25 30 35 
ete 
| oak yp Gausstan. 
50 5@ oF Pht = 30 i > 
| 14 - 
| | 
Oa Daa ae 6 OF 8 DION MSD IGT BIOs 


| 35 40 


Ww 


Lucy WHITAKER 43 


than if we apply the ordinary process of mean ng, standard deviation Vnpq, and 
Sheppard’s table for areas to the frequencies. It will be noted that this amounts 
to using Sheppard’s correction on the crude second-moment and slightly shifting 
the central ordinate towards the side of greater frequency. This is the Gaussian 
curve used in Table I. 


The object of the present section of our work is to indicate how far it is 
legitimate to use the Poisson-Exponential up to cell frequencies of the order 30 
in a population of about 1000* and how far we then reach a state of affairs, which 
for practical purposes may be described by ordinary tables of the Gaussian. It 
will be seen from Table I that the Poisson-Exponential even for ny =10 and 30 is 
not extremely divergent from the Binomial. 


In Plate VII the transition of the exponential histograms of frequency towards 
the Gaussian form is indicated for cell-frequency = 1, 5, 10, 15, 20, 25 and 30; in 
the cases of 10 and 30 the corresponding Gaussian curves are drawn. 

It will be seen that with due caution the Poisson-Exponential may be reason- 
ably used up to frequencies of about 30 in the 1000, and that after that it would 
be fairly satisfactory to use the areas of the Gaussian curve as provided in the usual 
tables. 


(6) In order to table the results of the Poisson-Exponential for easy use, it 
seemed desirable to turn them into percentages of excess and defect. For example 
take the distribution for a frequency 5. It is: 


Per cent. of Cases in which: 


0 006,737,945 a defect of 5 occurs : 0674 
1 033,689,725 3 4, oy more Pe: 4043 
Dy, "084,224,310 > 3 or more vor: 12°465 , 
3 140,373,850 P 2 or more an 26°503 
4, 175,467,310 as 1 or more ee 44-049 
5 175,467,310 the true value ar 17°547 
6 146,222,755 an excess of 1 or more ~. S 38°404 
a 104,444,825 ms 2 or more . 23°782 
8 065,278,015 - 3 or more ‘ 13337 
9 036,265,564 * 4 or more s 6809 
10 (018,132,782 P 5 or more *. 3183 
Tt ‘008,242,178 _ 6 or more ' 1370 
12 003,434,238 " 7 or more a 0°545 
13 001,320,860 _ 8 or more . 0202 


* Of course in the Poisson-Exponential itself the total frequency plays no part; it is only useful in 


testing the validity of the approximation. 


6—2 


44 On the Poisson Law of Small Numbers 


Thus we see that if the true value of the frequency be 5 for the average sample, 
it will only lie outside the tange 1 to 10 in 674 + 1:370 = 2044 cases per cent., or 
the odds are 49 to 1 that the value found will be from 1 to 10. 


On the other hand it will lie outside the range 2 to 8 in 4043+ 6°809 =10°852 °/, 
of cases, or once in about 9 trials the frequency will lie outside this range. Or, 
again, once in about every four trials (25°8°/,) the result will fall outside the 
range 3 to 7. 


On the other hand if we write «= 5 (1 — 005) = 223047, we have — 4°5 
and +55 as the deviations from a mean 5 of all beyond 0°5 and above 105, 
giving w/o =—2:0175 and + 24658 respectively. These cut off tail areas of 
02181 and ‘00684, respectively. Thus in 2°865—not 2°044—per cent. of cases 
we should assert that the frequency would he outside the range 1 to 10, or the 
odds that it would lie inside this range are now only about 34 to 1, not 49 to 1. 
Calculated from the Gaussian the frequencies outside ranges 2 to 8 and 3 to 7 
correspond to 10°1°/, and 26:2°/, of the trials instead of 10°9°/, and 25°8°/,. If 
we take for the standard-deviation of our Gaussian Vnpgq — yy = 2°21171, we find 
that the odds in the first case are still only 35 to 1, but the percentages in the 
other two cases are 11°3 and 25'8. 


It will be clear that near the centre of the curve—especially when we equalise 
the excess and defect of the Gaussian by taking equal ranges on both sides—it 
does not give bad percentages of frequency, but that it does not lend itself to 
the accurate determination of the range for reasonable working odds such as 
50 to 1. 


It will be noted that the total area in excess and defect of 2 and more 
= 23°782 + 26°503 = 50285, or corresponds very nearly to the “probable error.” 
Actually the Gaussians with standard deviations of 2°23047 and 2:21171 give 
probable errors of 1:504 and 1-492 respectively, so that the Gaussian with 1°5 as 
the probable error is very nearly accurate. 


Table II gives the Poisson-Exponential; it will enable the reader to appreciate 
the range of probable variation in small frequencies. Thus we realise that in 
37°/, of cases in which the true frequency is 1, the cell will be found empty ; 
in 13°5 per cent. of cases it will be empty when the actual frequency is 2, and in 
5 °/, of cases when the frequency is 8 and in 1°8 °/, when the frequency is 4. These 
results indicate how rash it is to assume that a sample 4-fold table with one zero 
quadrant signifies perfect dependence or association in the attributes of the 
material sampled. The second line below gives the percentages of cases that 0 
would appear in a cell when the actual number to be expected is that in the first 
line calculated from Table IT on the usual theory of a priort probabilities : 


Actual ih 1 4 5 6 | ren aes | 9 &over 


116 | 0-43 0-16 | 0-06 0°02 


| Percentage 


63°21 eee || 8°55 | 3°15 


0°01 


Lucy WHITAKER 45 


TABLE II. 
Table of Poisson-Exponential for Cell Frequencies 1 to 30. 


Cell Frequencies 


Per cent occurrence of values differing by x or 


Per cent. occurrence of values differing by « or more in excess 


Fy 1 2 8 J 5 6 ¢3 8 9 10 
22 
al 
20 
19 
18 
a 17 
elen 16 
lp ae 
= | 
Soi 
sia! 12 | 
oo ii | 
‘6 10 005 
i 9 ae 012 ‘050 
ce 8 034 123 yi | 
5 if 091 302 623 | 1:033 
S 6 248 7730) |) e tea7d |) 222123) |e 25925 
5 ——=——|  -674 | 1°735 -| 2:964 | 4:238 | 5-496 | 6:708 | 
4 1°832 | 4:043 | 6-197 | 8-177 | 9-963 | 11°569 | 13-014 
3B ———_| 4:979 | 9:158 | 12°465 | 15:120 | 17-299 | 19-124 | 20-678 | 22-022 | 
2 |__| 13-534 | 19:915 | 23-810 | 26-503 | 28°506 | 30-071 | 31°337 | 32-390 | 33-282 | 
1 | 36-788 | 40-601 | 42°319 | 43°347 | 44-049 | 44:568 | 44°971 | 45-296 | 45°565 | 45°793 
Actual] 36°788 | 27-067 | 22°404 | 19°537 | 17-547 | 16-062 | 14°900 | 13-959 | 13°176 | 12°511 
1 | 26:424 | 32°332 | 35-277 | 37:116 | 38-404 | 39°370 | 40-129 | 40-745 | 41-259 | 41°696 
2 8030 | 14-288 | 18°474 | 21-487 | 23-782 | 25-602 | 27°091 | 28:°338 | 29-401 | 30°323 
B 1°899 | 5-265 | 8:392 | 11-067 | 13°337 | 15-276 | 16-950 | 18°411 | 19°699 | 20°845 
4 366 | 1°656 | 3°351 | 5113 | 6:809 | 8-392 | 9852 | 11°192 | 12-422 | 13-554 
5 ‘059 453 | 1191 2°136 | 3:183 | 4:262 | 5:335 | 6°380 | 7°385 | 8°346 
6 ‘008 ‘110 “380 813 | 1°370 | 2:009 | 2°700 | 3°418 | 4:146 | 4:875 
i 001 024 ‘110 284 +545 883 | 1:281 1726 | 2°403 | 2°705 
8 000, -005 029 092 | 202 363 “572 ‘823 | 1:110 | 1°428 
9 =e ‘001 ‘007 027 ‘070 140 ‘Q4] 372 | 532 ‘719 
10 = 000 002 008 | 023 ‘051 096 159 | 242 346 
11 is = “O00 “002 ‘007 ‘O18 ‘036 ‘065 “105 “160 
a | 12 as = : ‘001 002 006 013 025 | 044 | ‘071 
E 13 — ; = _ “000,001 002 ‘005 “009 ‘O17 “030 
Se eR hs = i000 ‘001 002 | 003.) 007 ‘O13 
15 == weet) t= See S25 ‘000 ‘001 ‘001 | 002 ‘006 
EB | 16 = ae a = 000 ‘000 ‘001 002 
eel 17% = = — a = = - “000 001 
18 _ | = = ‘001 
19 = — — —_— | — — -—— a — “O00 
20 = a = 224 (ies = = = = — 
21 = = aa == hie = = a a —_ 
22 = = -_ = = = = = = = 
23 = = _ = = _ = == = — 
2 imate ee os — = os ee ee le = _ 
Ne = —_ = ao Se dig ee See = 
26 — = — — = a= = = = oe 
at | — —— == = = 


46 On the Poisson Law of Small Numbers 
TABLE I1—(continued). 
Cell Frequencies 
av 11 12 | 18 Uy 15 16 iif 18 19 20 
| | 
Ly ed 22 | 
hes 21 
= 20 a 
a 19 sik eS 
=p 18 — 
a | 17 = — ‘000 “000 
es | 16 | | — | »:000'| 000) | COM mmoue 
ea | 15 | | 000 000 | ‘001 002 | 004 | ‘007 
Be | Ly ‘000 | -001 002 | 004 | ‘008 | ‘015 | -026 
eB ole ite ‘000 | 001 004 | 009 | 018 | 032] -052 | -078 
eed He 001 003 | 009 | 021 040 | -067 | ‘104 | -151 209 
8 || any 002 008 | 022 | -047.| -086 | +138 | -206 | ~-289"\") -3Sia/Nae500 
es | 10 020 052 105 }_ 181 279 | 401 543 | -706 | 886 | 1-081 
a | 129 121 229 | 374 | 553 | -763 | 1:000 | 1:60 | 17538 | 1-832 | 2-139 
peril 8 ‘492 | -760 | 1-073 | 1:423 | 1:800 | 2-199 | 2-612 | 3-037 | 3-467 | 3-901 
-£] 7 | 1510 | 2-034 | 2589 | 3-162 | 3-745 | 4-330] 4-912 | 5-489 | 6-056 | 6-613 
ae 5 | 3-752 | 4582 | 5-403 | 6-206 | 6:985 | 7:740 | 8-467 | 9°167 | 9-840 | 10-486 
is 5 | 7861 | 8-950 | 9-976 | 10-940 | 11-846 | 12-699 | 13°502 | 14-260 | 14-975 | 15-651 
8 4 | 14319 | 15-503 | 16°581 | 17°568 | 18-475 | 19-312 | 20-087 | 20-808 | 21-479 | 22-107 
2 3 | 23-198 | 24-239 | 25-168 | 26-004 | 26-761 | 27-451 | 28-084 | 28°665 | 29-203 | 29-703 
2 2 | 34-051 | 34-723 | 35°317 | 35-846 | 36-322 | 36-753 | 37-146 | 37-505 | 37-836 | 38-142 
1 | 45-989 | 46-150 | 46°31] | 46-445 | 46°565 | 46-674 | 46°774 | 46-865 | 46-948 | 47-026 
| | 
|Actual, 11-938 | 11-437 | 10-994 | 10°599 | 10-244 | 9-922 | 9-629 | 9°360 | 9-112 | 8:884 
“A 1 | 42-073 | 42-404 | 42-695 | 42-956 | 43-191 | 43-404 | 43-597 | 43-776 | 43-939 | 44-091 
8 2 | 31:130 | 31-846 | 32:486 | 33-064 | 33-588 | 34-066 | 34°503 | 34-909 | 35-283 | 35-630 
z 3 | 21-871 | 22-798 | 23°639 | 24-408 | 25-114 | 25-765 | 26-367 | 26-928 | 27-451 | 27-939 
= 4 | 14596 15-559 | 16-450 | 17-280 | 18-053 | 18-776 | 19-451 | 20-088 | 20-686 | 21-251 
2 5 | 9-261 | 10°129 | 10-953 | 11°736 | 12°478 | 13-184 | 13-852 | 14-491 | 15-099 | 15-677 
= 6 | 5593 | 6-297 | 6-983 | 7°650 | 8-297 | 8-923 | 9-526 | 10-111 | 10-675 | 11-219 
s | v7 | 3219 | 3742 | 4266 | 4-791 | 5-311 | 5-825 | 6-399 | 6-826 | 7:313 | 7-789 
|= | 8 | 1769 | 2-198 | 2-501 | 2°884 | 3-275 | 3-669 | 4-064 | 4-461 | 4-856 | 5-248 
© 929 | 1160 | 1:407 | 1-671 ; 1°947 | 9-932 | 9-593 | 9:824 || 3107 \\iardae 
| > 10 467 | -607 762 | . 933° | 1117 | 1-312 | 1-516 | 1732" |S kobaeeoaiep 
foes ana) Ble 225 305 | 396 | 502 | 619 | -746 | -882'| 1-030 | 1:185 | 1:348 
aa | 12 104 | +148 | = -201 261 331 ‘All ‘497 | +595 | -699 | - -809 
58 | 13 047 | 069 | 097 | -131 172 | -219 | -272 | 333 | 400 | “478 
Seca, 020} ‘031 046 | -063-| 086 |" 1140) “14d | 1625) eos mmmoes 
ea | Is 008 ‘O14 ‘O21 ‘030 042 ‘057 074 ‘096 121 149 
295 | 16 003 | -006 | 009 | -013 | 020 | 028 | 036 | 050 | 064 | -081 
cali | ll 001 002 | 004 | 006 | ‘009 | 014] 017 025 | 033 | 042 
i 18 001 000 | 002 | -002 | 004 | 006 | 008 | ‘O12 | “O17, |) =e-0z2 
2 19 000 | 000-001 001 002 | 003 | 003 | -006 | 008] O11 
8 20 ae ae 001 ‘000 | -001 ‘002 | 002 | 003 | -004 | -005 
5 Of |, was = 000 | 000 | -000 | 001 001 | 002 | -002 | -003 
E Bea = 000 | 000-001 001 | ‘001 
So eee dine = = = ze = an = 000 | 000 | 001 
° 24 = = = = = = = = = 000 
= 25 at em | ees = = = = = = == 
8 26 = = ke i oi 
R a7 _ — — — — — — 
aw 28 Es at, ae ee Sn (ure — = = = 


Lucy WHITAKER 


TABLE Il—(continued). 


Cell Frequencies 


4 


= 


v 21 22 23 24 25 26 or 28 29 30 
Al , 
° Op | ae es ee = = = = = = = ‘000 
5 21 = ral -_ oe = =e ‘000 000 ‘000 ‘001 
8 20 —_ 2 — — 000 000 ‘O01 001 ‘001 “002 
| 19 ae aa -000 -000 ‘001 ‘001 002 | = -008 004 006 
2p 18 -000 000 001 001 002 004 006-009 012 ‘O17 
|-Ba | 17 ‘001 002 003 ‘005 008 ‘O11 016 | 023 ‘O31 ‘O41 
some 16 003 ‘006 ‘010 O15 022 ‘031 043 | 056 073 092 
| alee eee 012 020 -030 043 O59 ‘078 “102 “129 160 195 
An Mew 039 058 ‘081 “109 142 “180 224 273 328 387 | 
ES 1 ‘ll “150 ‘198 252 314 384 “460 543 632 727 | 
Se | 19 277 355 “443 540 ‘647 762 884 | 1:014 1-151 1-293 | 
pon lett “625 763 912 | 1:072 | 1°240 | 1-417 | 1-601 | 1°791 1987 | 2-187 
een | 10 1-290 | 1°512 | 1°743 | 1:983 | 2:229 | 2-482 | 2-739 | 3:000 | 3-263 | 3:528 
ze 9 2-455 | 2°778 | 3°107 | 3:440 | 3°775 | 4111 | 4:446 | 4°781 | 5-114 | 5-444 
ee 8 4-336 | 4°769 | 5-200 | 5-626 | 6:048 | 6-463 | 6°872 | 7-274 | 7-669 | 8:057 
zg Hi 7°157 | 7°689 | 8-208 | 8-713 | 9-204 | 9-682 | 10-147 | 10-599 | 11-038 | 11°465 
38 6 | 11°107 | 11:704 | 12-277 | 12:827 | 13°358 | 13°867 | 14°357 | 14°830 | 15-285 | 15°724 
aa 5 | 16292 | 16-900 | 17-477 | 18-025 | 18°549 19-048 | 19°525 | 19°981 | 20-417 | 20-836 
= 4 | 22°696 | 23-250 | 23°771 | 24:263 | 24°730 | 25-172 | 25°591 | 25-990 | 26-371 | 26-734 
© 3 | 30-168 | 30:603 | 31:010 | 31°391 | 31°753 | 32-094 | 32-416 | 32-721 | 33-011 | 33-287 
2 2 | 38-426 | 38-691 | 38:938 | 39°168 | 39-387 | 39°593 | 39°786 | 39:970 | 40143 | 40-308 
1 | 47-097 | 47:164 | 47-226 | 47-283 | 47-340 | 47-392 | 47-440 | 47-486 | 47-530 | 47-572 
Actual] 8°671 | 8:473 | 8-288 | 8-115 | 7:°952 | 7:799 | 7°654 | 7-517 7°387 | 7:264 
2 1 | 44-232 | 44:363 | 44-485 | 44-603 | 44°708 | 44°810 | 44:906 | 44:997 -| 45-083 | 45°165 
8 2 | 35-955 | 36-258 | 36-542 | 36-812 | 37-062 | 37-299 | 37-525 | 37-739 | 37-942 | 38-135 
2 3 | 28-397 | 28:828 | 29-235 | 29-620 | 29-982 | 30:326 | 30°653 | 30-965 | 31-262 | 31-546 
5 4 | 21-785 | 22-290 | 22-770 | 23-227 | 23-660 | 24-074 | 24:469 | 24°847 | 25-208 | 25-555 
a 5 | 16-230 | 16°758 | 17-264 | 17-748 | 18-211 | 18-655 | 19-083 | 19-493 | 19-888 | 20-269 
# 6 | 11-744 | 12-251 | 12-740 | 13-213 | 13-669 | 14:110 | 14°538 | 14°951 | 15-351 | 15°738 
a ” 8254 | 8709 | 9°153 | 9:585 | 10-007 | 10-418 | 10°819 | 11°210 | 11-591 11-962 | 
B 8 5-637 | 6:022 | 6-402 | 6-777 | 7:146 | 7:509 | 7:866 | 8-218 | 8°562 | 8-901 
& 9 3°742 | 4:052 | 4:362 | 4:670 | 4:978 | 5-284 | 5°588 | 5-890 | 6-188 | 6-484 | 
ES 10 2-415 | 2-654 | 2°895 | 3:188 | 3°385 | 3-632 | 3°880 | 4:129 | 4:377. 4°625 
a 11 1517 | 1:692 | 1°873 | 2-057 | 2-246 | 2-438 | 2-633 | 2°831 | 3-030 | 3-230 | 
tai | 812 927 | 1°051 | 1°18] 1315 | 1°456 | 1:599 | 1:°747 | 1:899 | 2-053 | 2-210 
isso 13 “552 637 727 “821 92] 1025 | 1°134 | 1:247 | 1°362 | 1:481 
HS 14 320 376 437 “500 570 643 720 ‘801 885 | 973 
4 15 ‘181 217 256 298 “345 394 ‘448 “504 564 626 
2 | 16 ‘100 -122 147 ‘173 204. 237 272 311 352 395 
‘ee | 17 054 -067 082 ‘098 ‘118 139 162 “188 ‘215 | +245 
ig 18 -028 036 ‘045 ‘055 ‘067 ‘080 095 ‘111 129 | +149 
° 19 O15 ‘019 024 -030 ‘037 ‘045 054. ‘065 076 | 089 
S) 20 007 010 013 016 ‘020 ‘025 ‘03 ‘037 044 052 
B 21 004 -005 ‘007 009 ‘Oll | 014 ‘O17 021 025 ‘030 
= 22 002 ‘002 004 ‘005 006-007 009 ‘O11 O14 ‘O17 
8 23 ‘001 ‘001 002 ‘003 003-004 ‘005 006 ‘007 ‘010 
S Qh 001 001 ‘001 002 002 002 003 003 004 006 
= 25 ‘000 000 001 ‘001 ‘001 001 ‘001 002 002 003 
8 26 =s ee -000 ‘000 000 ‘001 001 ‘001 ‘O01 002 — 
x or es = : ‘000 -000 000 -000 ‘001 
a 28 Bs = = -_ _ : | 


“O00 


48 On the Poisson Law of Small Numbers 


PART II. CRITICISMS OF PREVIOUS APPLICATIONS OF 
POISSON’S LAW OF SMALL NUMBERS. 


(7) We now turn to the illustrations which various authors have given of 
the Law of Small Numbers. 


“Student's” Cases. We take first the series given by “Student” in his memoir 
on counting with a Haemacytometer*. They are of special importance because 
the series at first appear of fairly adequate size, namely consisting of 400 
individuals, and further we should anticipate that the Law of Small Numbers 
would hold in his cases. He obtains better fits with the binomial than with the 
exponential but, as he remarks, he has one more constant at his disposal. On the 
other hand, if the exponential be a true approximation, the binomial ought to come 
out with a large n and a small but positive g. “Student” finds for his four 
series : 

L400 x (111893 — 1893)-2™, 
Il. 400 x (97051 + 02949 624, 
III. 400 x (1:0889 — -0889)-2"™, 
IV. 400 x (9525 + 0475989", 

{I. and IV. may, perhaps, be held fairly to satisfy the conditions, although it 
is not certain if 46 is to be considered a large n or ‘05 a very small q. 


I. and III. fail to satisfy the conditions at all, unless the probable errors of q¢ 
and n are such that g might really be a small positive quantity and n really large 
and positive. The following are the values for the four series of n and q and their 
probable errors : 

I g=— 1893 +0647, n=— 3°6054+4 1:2209. 
Il. qg=+°0295 +0457, n= 46°2084 + 71°7373. 
III. g=— 0889 + 0534, nm = — 202473 + 12°1165. 
IV. qg=+ 0475 +0452, n= 985263 + 93°7494. 


Now while these results are very satisfactory for II. and IV., they are not 
wholly conclusive for I. and III. We can approach the matter from another 
standpoint; the probable error of g for p=1 is 


il . 
67449 Ta V2 = 67449 x 0707 


in “Student’s” cases. Thus the deviation of q from q a very small quantity is for 
I. 2°68 times the 8. D., and for III. 1:26 times the 8. D. Since g may be either 
positive or negative, we may reasonably apply the probability tables and the odds 
against deviations occurring as great as these are in one trial about 250 to 1 and 
9 to 1 respectively. Hence in four trials we should still have large odds against 
their combined appearance. 

* Biometrika, Vol. v. p. 356. 


Lucy WHITAKER 49 


We have said that the results for Il. and IV. are fairly satisfactory, ie. we 
mean that they are consistent with g being small and positive and n being large ; 
but of course they are also consistent with g being negative and n being small and 
negative. 

It will be obvious from these results for “Student’s” data that it is extremely 
difficult to test the legitimacy of the bypothesis on which the “Law of Small 
Numbers” is based. In none of the cases dealt with by Bortkewitsch, much less 
in those dealt with by Mortara, are the populations (V) anything like as extensive 
as those considered by “Student.” But populations of even 400 give, as we see, too 
large values of the probable errors of g and w for us to be certain of our conclusions. 


(8) Bortkewitsch’s Cases. Taking Bortkewitsch next, he deals with the 
following cases : 

I. Suicides of Children in Prussia for 25 years: (a) Boys, (b) Girls, 25 cases. 

II. Suicides of Women in eight German States for 14 years: 112 cases or 
8 subseries of 14. 

III. Accidental Deaths in 11 Trade Societics in 9 years: 99 cases, or 11 sub- 
series of 9, 


IV. Deaths from the Kick of a Horse in 14 Prussian Army Corps for 20 years: 
280, or, as Bortkewitsch, 200 cases. 


It will be noted at once that Bortkewitsch’s populations (1) are far too small 
for any effective determination of the legitimacy of his application of Poisson’s 
formula to his data. 


We take his cases in order: 
I. (a) Suicides of Boys. 


TABLE IIL. 
| Number of Suicides _... ON LZ AS We 5 6" | 7andiover 
| | | 
| -——-—-— cae at Se } 
| Number of Years So 4 | 8 | Des) 4s 108 1 | 0 


The binomial is: 
25 [1:2033 — -2033]-°™, 
Mean 1:9600 and yw, = 3:2584. 
We have g= — 2033 +°2421, n= — 96425 + 109416. 


If y were really zero its probable error would be +1908. Clearly 25 cases are 
wholly inadequate to test the legitimacy of applying the Poisson-Exponential to the 
frequency*. But to what extent is the reader made conscious by Bortkewitsch 
that his cases fail entirely to demonstrate the legitimacy of applying his hypotheses ? 

* The x2 for the binomial is 2°379 and for the exponential 2°836, showing a somewhat better 
result for the binomial. 

Biometrika x 7 


50 On the Poisson Law of Small Numbers 


I. (6) Suicides of Girls. 


TABLE IV. E 
Rees ae | | 
Number of Suicides... 0 vi | 2 3 | 
| — ——— 
Number of Years a 15) | oan | 0) 


The binomial is: : 
25 ['7418 + °2582 7, 


Mean = 4400 and yu, = 3264. 
We find g ='2582 +1012, n=1°7041 +°7850. 
As in the case of the boys’ suicides, if g were practically zero its probable error 


would be + ‘1908, and there is nothing in this result again to justify us in asserting 
that q is indefinitely small and n indefinitely large. 


Actually we have: 


TABLE V. 
Number of Suicides per Year. 
0 1 aye mans 
x 7 | 

Actual ... ee 15 9 1 ure 
Bortkewitsch 16-1 71 [cg | ees 
Binomial (a)... 15:0 8°9 11 — 
Binomial (0) 15:2 8°7 11 — 


(a) is the binomial considered above, (b) is the binomial obtained by taking 
n a whole number = 2, and g= mean/2 = ‘22, we. 25 (78 + °22). 

It is clear that either (a) or (b) gives better results than the Poisson-Expo- 
nential. Applying the test of goodness to fit, we have 

x? = ‘007 for the binomial (qa), 
x? = 610 for Bortkewitsch’s solution. 
Both give P > ‘60 but the first is much better than the second. 
If both boys and girls are taken together, we find the binomial 
25 (9333 + (0667). 

This is the nearest approach to a small q and big 7 we have so far found—ze. the 
nearest approach so far to an exponential, but it is reached by a process, «.e. that of 
adding together two series of entirely different means and variabilities in a manner 
which cannot be justified, for Bortkewitsch’s hypothesis depends essentially on the 


homogeneity of his material. Even here the fit of the point binomial is slightly ; 
better than that of the exponential. 


Lucy WHITAKER 51 


II. Suicides of Women in hight German States. Bortkewitsch gives the 
following table : 


TABLE VI. 
Number of Suicides of Women per Year 
State - - Totals 
Ore iee ee eel Sachse he Sl o.8| Fo 
| | 

| (a) Schaumbureg- tnppe: 4) 4) 2) 4)/—]--|]— | 14 
(b) Waldeck... Mee huer dl 8 A he | 14 
(ce) Liitbeck ee ves elle i geale cele a eee os 14 
(d) Reuss a. L. . ce soni | oil | Sl soulmate 2a Ue Pe ms 14 
_(e) Lippe PR Gy OU) BE ay a | | 14 
(f) Schwarzburg- Rudolstadt ... | — ; 1}—}; 2)/—) 56]; 8) 2) 1)—)— 14 
(g) Mecklenbure- Strelitz soe |e MBL) PE ee ea eT So eee eae 14 
| (A) Schwarzburg-Sonderhausen SN a ed Sc he 14 
Totals 112 


The resulting binomials are : 

(a) 14( 9714 + 0286)", 
(b) 14( ‘8571 +°1429)9%, 
(c) 14( 5819 +4181), 
(d) 14 (1:0058 — -0058)-##24, 
(e) 14(1°3929 — -3929)-77, 
(f) 14( 6071 + :3929)3%, 
(g) 14(1°5792 — 5792)-91"7, 
(h) 14 (16609 — -6609)-3, 


Thus it will be seen that of the eight binomials only four have a positive q, 
and of these only one can be said to have a very smajJl g, and even in this case the 
n is not indefinitely large. Of the four negative binomials three have quite 
substantial q’s, and the fourth with its small negative qg corresponds most closely 
to the Poisson-Exponential. The probable error of g for g=0 is +:2549. The 
number, 14, of cases taken is therefore wholly inadequate to test whether the 
Poisson-Exponential may be applied to these data. The mean value of q is 
negative and = — ‘0820 + ‘0901, and the standard deviation of g=:3928 + -0637, 
which are within the limits of random sampling of g =0 with a standard deviation 
of 3779. We shall return to a different manner of considering the point later. 
At present we wish only to indicate that the hypothesis is that q is a very small 
positive quantity and that data which give ga standard deviation of ‘3928, or in 
the next example of 4714 are really inadequate to test such a hypothesis ; for in 
the resulting binomials g may easily lie anywhere between +°8 and —°8, and it 
is not possible to demonstrate that its real value is practically an exceeding small 
positive quantity. 

7—2 


52 On the Poisson Law of Small Numbers 


III. Accidental Deaths in 11 Trade Societies. Bortkewitsch provides data 
from which the following table is deduced: 


TABLE VII. 


Accidental Deaths 


Index Number 


of Society Totals 


1 ] | ; 

0 BNO NG We | eae | 10 al 
13 S Syl 9 
1h ; 2 3 9 
12 fq 3 9 
20 =a a 9 
23 ei 9 
QF AG pas 9 
29 Ne ie 9 
Al Te hee 9 
40 1 2 9 
42 }— | — 9 
55 — 2 9 

Totals ... | 16 | 7 


The resulting binomials are: 
(18) 9( 4914+ -5086)5°8, 
(14) 9( 61844 -3816)"%, 
(12) 9:(1:9227 = 9227), 27, 
(20) 9 (11282 — +1282)-s2"e00, 
(23) “9° 9921 2.0079) eenes 
(27) 9( 52294-4771), 
(29) 9 (14130 — -4130)72™, 
(41) 9( 8454 + +1546)9°66, 
(40)° 9 (2:0342 — 1:0842)-27™4, 
(42) 9( 9822+ -0678)72”, 
(55) 9( 6154+ °3846)n2, 

Of these eleven binomials seven have a positive g; only one of these (23) 
actually corresponds to a really small q and large n, although a second, (42), 
approximates to this condition. In the five other cases the q’s are quite sub- 
stantial; in (13) the q is larger than p. Of the four negative q’s none can be said 
to be so small and the » so large as to suggest that they really correspond to the 
Poisson-Exponential. The probable error of q for q=0 is, however, + ‘3180, and 
thus for such small series, no test whatever can be really reached of the legitimacy of 
applying the Poisson-Exponential to such data. We may note, indeed, that seven 
of the eleven values of g exceed the probable error and two of these are more than 


three times the probable error. We should only expect two negative values of ¢ 
as great or greater than ‘9227 in 80 trials, whereas two have occurred in 9 trials, 


Lucy WHITAKER 53 


so that the odds are considerably against such an experience. 
g is — 0469 +0959 and the standard deviation of g is ‘5127 + ‘0678, both results 
compatible with g indefinitely small and a standard deviation = “4714. The main 
problem, however, of the legitimacy of applying the Poisson-Exponential to such 
series cannot be answered by data involving only total frequencies of 9 to 14 
cases in the individual series. 


He clubs the 
results given for each application of the Poisson-Exponential together and 
examines the observed totals against the sums of the calculated totals. Thus 
calculating the 11 Poisson-Exponential series* and adding them together 
Bortkewitsch finds for observed and calculated deaths: 


TABLE VIII. 
Accidental Deaths in 11 Trade-Societies. 


Bortkewitsch examines the matter from another standpoint. 


The mean value of 


Number of Deaths 


Observed Frequencies 
Sums of 11 Exponentials 


Single Binomial .. | 3°8 | 95 


0 


10 | 11 | 22 


oe O) Te 13 4: 16 7 
3-7 15°2 | 14-3 | 12°3 | 9°8 


| 


20|12)o7| 0-7 


13°9 | 15°6 


14-8 | 124 9°6 


138 & over} Totals 


If we attempt to fit a single binomial to the observed line of totals, we obtain: 
m= 43636, o2=7'°5849 
leading to the negative binomial : 
O97 382 — 7802). 7s, 
g=— 1382, + 18297,- n=— 59111 + 1391, 


or the constants are significantly substantial with regard to their probable errors. 
The resulting frequencies are given in the last line of the table above. The reader 


Here: 


* The values of the means and standard deviations for the eleven societies are : 


m | o m o m o 
13 7889 1:969 23 6°222 2-485 || 40 2°889 2°424 
14 2-556 1:343 27 1'889 0-994 || 42 | 4-556 2-061 
12 2556 | 2217 29 5889 2-885 || 55 4°333 1°633 
20 4:333 | 2-211 41 5111 2079 || | 


All these means are less than 10, which is the limit reached by Bortkewitsch’s Tables for the Poisson- 
Exponential. Bortkewitsch says he has taken the societies for which ‘the statistics indicated the 
smallest numbers of such accidents.’’ This is not very clear. It is certain that a society with a mean 
number of accidents =100, if it consisted of 200,000 members, would be more suitable for application 
of the exponential, than one with a mean of 8 if it only contained 10,000 members. Both Bortkewitsch 
and Mortara confine their results to means less than 10, and seem to indicate that ‘‘ smallness” has 
been determined by the absolute frequencies, but clearly it is relative frequency with which we have to 
deal. The use of such a term as Das Gesetz der kleinen Zahlen for the Poisson-Exponential seems open 
to serious objection, if it be associated with ‘‘m” an absolutely small number, and not with smallness 
of ‘¢q.” 

+ For q=0, the probable error would be +°0959 and accordingly q is very divergent from the 
Poisson-Exponential value of zero. 


54 On the Poisson Law of Small Numbers 


will be surprised to see how closely the single negative binomial determined by 
two constants gives the same result as the sum of the eleven Poisson-Exponentials 
determined by eleven constants, no one of which is really of any significance for its 
own exponential*. If we apply the condition for “goodness of fit,” »?= 5°83 for 
the single binomial and y?= 5°88 for the sum of the eleven Poisson exponentials, 
leading to P='950 and P= ‘951 respectively, or the fit with a-single negative 
binomial is slightly better than that with eleven exponentials. The two constants 
are significant, the eleven constants have no real significance for their individual 
series, as is demonstrated by the fact that the binomials for these series do not 
approximate to the Poisson-Exponential type. 


We may now consider the previous case of suicides of women from the same 
standpointt. The following are the data as given by Bortkewitsch : 


TABLE IX. 
Suicides of Women in Hight German States. 


| Number of Suicides 0 | 1 | 2 | Beall oA | 8 | 9 | 10 & over Totals 


Or 
fo) 
Se 


| Observed Frequencies 9? S198 O05 115 Uae ee eee ate 3 112 

Sum of 8 Exponentials | 8°0 | 16°9 | 20°3) 18°7 | 15-1 | 11°4 | 8°3 | 5°6 | 3°6 | 21 | 2°0 112 

| | 

| 

Single Binomial ... | 12°6 | 18:4 | 18°8 | 16°4 | 13-2 | 9°9 | 7:2 | 5-1 | 35 | 24 | 4°5 112 
i | 


For the single binomial we have : 
m = 3'°4732, o7? =8:2312, 
leading to: 112 (2:3699 — 1°3699)- 25354, 
where q=— 13699 +°1490, n= — 25354 + 8076. 


If q were very small its probable error would be +0901. The values of g and n 
are quite significant, g is large and negative and n is small and negative. The 
resulting frequencies are given in the last line of the table as “Single Binomial.” 
Turning now to the test of “goodness of fit,” we have for the sum of the 8 ex- 
ponentials y?= 7:957, and for the single binomial y?= 7°740, leading to P= 633 

* If the reader will turn to the first footnote on p. 53 he will note that for nine cases, the standard 
deviations of the means (o//9) are roughly about -7 or errors of +1 to +1:5 may easily occur in the 
means. Hence with the possible exception of (13) and (27) the m’s have not significant differences, and 
are not typical of the individual societies. 

+ The values of the means and standard deviations are: : 


| m o | m o | 
Schaumburg-Lippe | 1:°429 1-178 Lippe Are ag on 2°857 1-995 
Waldeck eligi) poco 1:378 | Schwarzburg-Rudolstadt ... 5143 1:767 
Liibeck ... ais 2-571 1:223 | Mecklenburg-Strelitz or 5°286 2°889 
Reuss a. L. we =| 2648 1:631 Schwarzburg-Sonderhausen | 5°642 3-061 


The standard deviation of the mean is here o/V14, or, say, 5. Thus errors of 1 might easily occur 
in the values of m. There are probably significant differences between the first five and the last three 
states, but not between the first five among themselves or the last three among themselves. Thus the 
Poisson-Exponentials, if correct in theory, are not significant for the individual states, 


Lucy WHITAKER 55 


and ‘654 respectively. Thus again the single binomial with only two constants 
give a fit slightly better, than the sum of eight exponentials with eight constants. 

Bortkewitsch looking at the observed frequencies and the sum of 8 or 11 
exponentials—without using any satisfactory test for “goodness of fit ”—assumes 
that the coincidence is so good as to justify his hypothesis. But a better fit can 
be obtained with two instead of 8 or 11 constants by simply using a negative 
binomial. We must note here that Bortkewitsch is using the final coincidence 
merely as justification of the Poisson-Exponential; the total frequency is not 
describable in terms of the 8 or 11 constants as it is in terms of the two, for 
these eight constants are not really significant for his individual eleven trade 
societies or for the suicides in the individual eight states. If he wants to describe 
the total, he has no constants by which he can do it. If, on the other hand, he 
wishes to describe what has occurred in the individual societies or states, we have 
seen that their binomials differ very widely from Poisson-Exponentials. If, lastly, 
no stress be laid on the individual cases as having too large probable errors, but 
only on the general coincidence with total frequencies, then the same coincidence 
would justify us in using a single binomial with two constants only*. It appears 
to us that to properly test the Poisson-Exponential, we need not 9 or 14 instances 
in the individual case, but several hundred instances,—more, indeed, than “Student” 
has taken—and that no proof of the “Law of Small Numbers” can be obtained 
on data such as those of Bortkewitsch or Mortara. 


IV. Deaths from the Kick of a Horse in Prussian Army Corps, omitting four 
Corps with Bortkewitsch. 


Here the results are: 
TABLE X., 


| Number of Deaths ... 0 1 Z 3 | 4 Totals 


Number of Corps —... 109 65 22 3 1 | 200 


Whence : 
m='61, p.='6079 
and the binomial is: 
200 (996,557 + 003,443 771707, 
This is the first of Bortkewitsch’s illustrations for which his hypothesis that q is 
small and n large is really justified by his data. For: 
q = 0034 + 0670, 
n= 1771711 + 3449:108. 
The probable error of g for q really zero is + ‘0674. 
* Of course immensely better general total fits are obtained by using the sums of the actual 8 or 11 


binomials than by the Poisson-Exponential sum or the single binomial, but the results in that case 
involve 16 or 22 non-significant constants. 


56 On the Poisson Law of Small Numbers 


The actual results as given by the binomial and the Poisson-Exponential are: 


TABLE XI. 


| i 
Number of Deaths ... | 0 eae ae 2 3 4, and over | 
= we |I— | | | | 
Observed Hae as 109° =4)), Ob" 22 3 1 
Binomial a ax 108°6 | 66°4 | 20°2 Ail 0-7 
Exponential ... “3 108°7 | 66°3 20°22 | 4:1 O'7 | 


Actually if we work to two decimal places in the frequencies we have y? = ‘61 
for both binomial and exponential, or the goodness of fit is practically identical. 


In this case it seemed worth discussing the binomial fit more at length. 
Taking the moment coefficients about the mean we have: 


(1) Mean =ng="6100. 


(11) bs = npg = 6079. 
(111) Hs = npg (p — g) = 590,562. 
(iv) fy = npg (1 + 8npgq — Opq) = 1:°643,373. 


We have already discussed the binomial from (i) and (ii), giving x” for goodness 
of fit ="6096. Using (11) and (111) we have for the binomial 
200 (985,739 + 014,261), 
giving x? = 665. 
Using (111) and (iv) we have: 


200 (979,524 + °020,057 303, 


giving x? = 707. 
Putting : B.= Me | pe? and Bi = M3/ ps?, 
we have: B,—-3 =(1—6pq)/npq, Bi. = —4pq)/npy, 


and working from 8, and £, we find: 
200 (969,150 + :030,850)89™, 
and in this case x? = 1:1286. 


This of course does not give a bad fit, but it is clear that working from the 
lowest moment coefficients, as we might anticipate, gives the best results. 


But if q be the chance of death from the kick of a horse, and n the number of 
men in an army corps, then the binomial should be 


200 (p +q)” 


Now it is obvious that none of the binomials give, by their value of n any 
approach to the real number of men in an army corps. If we start with the 


Lucy WHITAKER 57 


number of men 7 in an army corps as 50,000*, we have ng ="61 and g=:000,0122, 
thus reaching the binomial 
200 (-999,9878 + 0000122), 


giving as compared against Bortkewitsch : 


Binomial Bortkewitsch 
0 108°6876 108°6703 
1 66°3002 66°2889 
2 20°2213 20°2181 
3 41115 41110 
4 and over °*7035 ‘7034 
and y? = 608,298 608,318 


or, the slight advantage to the binomial exists but is of no significance. 


Now it seems to us that in this case the use of the exponential is justified for 
the total frequencies, but as far as describing those frequencies is concerned, it 
gives no better result than the binomial. But as in the other five of Bortke- 
witsch’s cases the Exponential is not justified by the individual series themselves. 


It is perfectly true that the exponential has a definite theory behind it, and 
is interpretable in terms of that theory, i.e. we must suppose the probability of an 
occurrence very small and the chance of its repetition absolutely identical. But 
is the second of these conditions ever likely to be demonstrable a priori, or must 


* This supposes that every man in the army corps is equally liable to death from the kick of 


a horse; of course a very arbitrary assumption. 
+ To illustrate the idleness of the application of the Poisson-Exponential even to these data for the 


Prussian Army Corps, we give here the binomials for the whole of the 14 corps. 


Index Number 
of Corps Binomial 


G 20 (-95 + -05)16-0000 
: 20 (1325 — +325)-2-4615 
ul 20 (1:5667 — 5667) —1-0585 
Tl 20 (-9 + +1)6-0000 
IV 20 (-6 + -4)1-0000 
W 20 (-6318 + +3682) 1-4938 
Vi 20 (1:0912 — :0912)—9-3202 
VII 20 (-9 + +1)6-0000 
VI 20 (-65 + +35) 1-000 
Ix 20 (‘8115 + *1885)3-4483 
x 20 (1°05 — -05)-15-0000 
XI 20 (1-11 — -11)-11-3036 
XIV 20 (1:05 — -05)—24-0000 
XV 20 (1:1 — *1)—4-0000 


One seeks in vain through these binomials for any approach to q very small and positive and very 
large and positive. In no case does n approach the number of men in an army corps, say 50,000, 
or q equal the chance of a death from the kick of a horse, say, ‘0000122! It seems impossible by 
clubbing such equations together to give any satisfactory proof that the Poisson-Exponential really does 
apply to individual cases. In the 20 years involved, there were doubtless great changes in both 
the training and the personnel of each army corps, and the results obtained may be just as much due to 
such causes as to the errors of small samples. 


Biometrika x 


58 On the Poisson Law of Small Numbers 


not we a posteriori demonstrate it from the data themselves? Child suicide may 
be influenced by example, by environmental conditions in different districts, 
possibly even by meteorological conditions in different years. Again, even in 
different army corps the conditions may be far from uniform, the spirit of the 
corps, the teaching with regard to the handling of horses, the experience of past 
life according to whether the corps is raised in town or rural districts may all tell. 
Even Bortkewitsch before he gets his best fit removes four corps or 80 observations 
from his data. We do not criticise this removal, but even unremoved he says the 
fit of theory with experience leaves “wie man sieht, nichts zu wiinschen iibrig” 
(p. 25). But the binomial is before removal: 
280 (1:085,714 — 085,714)-8 155 

in which q is not very small and is negative, and n is not very large and is not 
positive. It is true that the probable error of g for q insignificant is in this case 
+0570, but this only shows that the data were insufficient in quantity to 
determine whether the exponential could be applied or not. 


(9) Mortara’s Cases. 

Mortara* in an interesting paper has realised the possibility of repetitions not 
being independent and has discussed a constant @’, by which he proposes to test 
such influence. This quantity Q should be unity, if the Bortkewitschian hypo- 
thesis can be applied. He then takes 16 or 17 districts with records of 10 years, 
and calculates the mean number of deaths from some special cause per year, say, 
for each district for those years. If this mean number exceeds 10, he casts out 
that district, presumably on the ground either (1) that such a number is no 
longer small, or (ii) that it differentiates the district from those with lower 
numbers. Thus Bologna with 10°9 deaths by murder is excluded and Bergamo 
with 84 is included, although Q’=1 for both. Bologna with 7:1 deaths from 
smallpox is included, but Pavia with 12°3 is excluded although the Q’ of the 
former is 2°5 and that of the latter 1:7. What method should be employed in 
dealing with the frequency of the excluded districts which may amount to 50 °/, 
of all districts is not discussed. Having thus reduced his available districts, 
Mortara proceeds to apply the exponential to each individual district ; he adds up 
the results for each district and compares his totals with the observed totals. It 
will thus be observed that he fits his exponential to ten observations, and then adds 
together five or more districts to get his totals. We can equally well apply this 
process by fitting a binomial to each 10 observations and then adding up such 
results. But it is quite clear that on the basis of ten observations, it is, owing to the 
large probable errors, wholly impossible to assert, whether a binomial of the kind 
required by the Bortkewitsch-Mortara hypothesis,—i.e. one of very small positive q 
and very large positive n—really is justified. We can illustrate this at once from 
Mortara’s Tables (see his pp. 42 and 45) for deaths from Chronic Alcoholism. The 


* « Sulle variazioni di frequenza di aleuni fenomeni demografici rari,” Annali di Statistica, Serie v. 
Vol. 1v. pp. 5—81. Roma, 1912. 


Lucy WHITAKER 59 


observed numbers, and those deduced from the binomials are given in the 
accompanying table. At the foot are the observed totals, Mortara’s exponential 
totals and the binomial totals. 


TABLE XII. Deaths from Chronic Alcoholism. 


Oe Cason aber) |) ling | 9 | io | ar | a2 las |e | 
| | | | 
| | | i. i st | | 
Calabria 1 ® 4 — 2 = |) = | | — | — | Observed 
1:49] 2°84) 2°70) 1°71 81]; -31}) -10] -03)| -01 = | —- | Mortara 
118} 2°85} 3:06} 1:91 "76 | °20} -03 | - | = | | Binomial 
Foggia 1 2 4 — | 2 oS || | | | O. 
1:00} 2°30] 2°65] 2:03) 1:17 54 | “21 07 | °02] -O1 | M. 
96) 2°29) 2°70 | 20S eelaliSae <b3 19-06 Ol; — | B. 
Siracusa 2 1 3 — 2 2) | - —;/;—|— | 0. 
*82| 2°05] 2°56] 2°14 34 67 | 23) “10 | -03)| “Ol — — |/—}]}—|— |M. 
112) 2:16). 2°33} 1°85] 1:21] ‘69 S4ay | kr | S07) 03 Ol B 
| 
Potenza 2 — 2 2 1 1 ea ee | = /O 
4 1°30) 2°09) 2°23) 1:78) 1:14 Gil 288 O04 ol; -- |—/;—}|]— |M 
78| 1:6 1:95} 1°80] 1°41] -98 63} °38 21 12} ‘06; °03; -01} 01] — | B 
Catanzaro | 1 1 Smee eee = 1 1 1 — | — Re pel |= Peony 6) 
15 s63= l32.|- 1-85)| 1:95.| 1:63) 1-14) 369 | +36 17 ‘OT (0953 |) COE |) |) Se 1 AE 
Solmelecon LAG (0 1363) Peli | 95 75] 57 | +43 31 23) °16/ 12) °08 7/B 
Salerno 1 1 il _— 2 — il 1 2 — | — ae Oz 
06 31 e19ee eso) 72") 75a 1491-09) -69)) <39 20 9} 04] °0 “Ol | M. 
40 86) 1:18} 1°31} 1°27)1°14] °95 lide | s09 45 33 | °24 | ‘17 22 1B 
| | 
Cosenza 2 = I = l Sy 8) 1 a el =| 1}/—|— |0O. | 
06 29 foe 29) |e e1e68i\ dere Io 112 73| °42] °22 0) | 05.) -02)| 01) M 
43 88} 1 ele 7d D30) TiO 93 75 59 | °44 33 24/)°17|°13| °33)B. 
Bologna = | = 3 ea al i || ea ae een eee 1 S| 0) 
‘Ol 06 21 49) +88] 1:24] 1°47] 1°49 | 1°32 | 1:04 74| °48) °28/°15|) °14) M. 
‘40 46 97) 1:06] 1:05] -98 87 76) ‘64 | 53 43°35 8] “21 7 133 
: | | | if | 
| | 
Totals 10 8 Qi alin 4: WM BY i | 8} il il 2 1 | — 2 | O} 
. 4:00 | 9°78 | 13:07 | 13-09 | 11-33 | 9:03 | 6-81 | 4°87 | 3:27 | 2°08 | 1:24} -70| -38|/-19] -16]M. 
6°15 | 12°75 | 14°82 | 12°64 | 9°28 | 6-57 | 4°70 | 3°46 | 2°54 | 1-88 | 1:38 | 1°02 | °75 | 55 | 1°43 | B. 
| | aera bao Z = 


The following are the binomials for the 8 districts out of 16 which Mortara 


has selected. 
Reggio Calabria 10 ( *7842 + -+2158)ts8so9 


Foggia 10 ( 9609 + -0391)87" 
Siracusa 10 (1°3000 — °3000)-3# 
Potenza 10 (15500 — °5500)3!2 
Catanzaro 10 (2°7524 — 1°7524) 229% 
Salerno 10 (2:3510 — 1:3510)-27 
Cosenza 10 (25308 — 1:5308)-*# 
Bologna 10 (33161 — 23161)" 


60 * On the Poisson Law of Small Numbers 


Examining these we see that there are only two in which g and n are positive 
and only one in q is small and positive and n moderately large. The probable 
error of g for 10 observations on the assumption that n is very large and q very 
small is + ‘3016 and is quite inconsistent with the last four districts being samples 
from exponentially distributed frequencies. The other four districts may or may 
not belong to such frequencies—the data are wholly madequate to determine 
whether they do or not. Reggio Calabria and Foggia have the lowest Q's, 
ie. 09 and 1:0. But that six districts out of an already selected eight give 
negative q and a seventh a relative large g and small n suggests the inapplicability 
of the hypothesis adopted. If we seek for “ goodness of fit” of the totals, we find: 


Binomial Exponential 
y? = 2512 47-92 
P0336 ‘0000 


Thus the odds against the binomial system are 28 to 1, but the odds against 
the exponential are enormous. It does not seem possible to justify the treatment 
of such data by the use of the Poisson-Exponential. 


Let us turn to a second of Mortara’s illustrations, that of deaths from small- 
pox. He rejects first six out of the 17 districts, the remaining ten are given in 
Table XIII. The districts give the following binomials: 


Venezia 10( 9500+ 0500) 

Bologna 10( 9889+ 0111)" 

Treviso 10 ( 2:2000 — 1:2000)-*8 
Pavia 10( 1:8000 — 8000) 
Caghari 10( 45190 — 3:5190)-> 
Padova 10 ( 36833 — 2°6833)-%# 
Verona 10( 56000 — 4:6000)-™" 
Brescia 10 (O97 20 — B8i0i 2h) 
Bergamo 10 ( 23821 — 1:3821)-7=9 
Catanzaro 10 (156128 — 14-6128)~ 76 
Vicenza 10 ( 34854 —  2°4854)-Ve97 


Out of the eleven cases only two give g small and positive; not a single one 
gives for g anything like the chance of a death from small-pox in the district, nor 
for n anything like the population of the district. There is an increasing divergence 
from the positive binomial as Mortara’s Q’ increases in value. We see that in nine 
cases, however, a negative binomial not the exponential is required to describe the 
frequencies. The probable error of gq, for insignificant q is as before + 3016, and 
therefore it is improbable that g is zero in at least 9 out of these 11 districts. 


Examining the totals we find 
Binomial Exponential 
v2 = 9°64 * 570°79 
Oi ‘000,000 


Lucy WHITAKER 61 
TABLE XIII. 
Deaths from Small-powx (1900—1909). 
: fy | 12 or 
0 1 2 8 4 5 6 : g er ea lee | more 
Venezia 4 5 = 1 7 || le Observed | 
4:49 | 3°60] 1°44 38 08 01) -- — = — —|— — | Mortara 
4°40} 3°71} 1°46 36) 06) ‘01 -—— | Binomial 
Bologna 4 4 1 1 — —|/|— —}|} — |90. 
4:07 | 3°66} 1°65| 49 ear Nfs ee: ©) een |r| ean cn | i ee IVT 
4:04} 3°68} 1°65 “49 ‘ll 02 Ol; — |) — B. 
Treviso 5 3 7 pe == 1 | | 0! 
3°68 | 3°68} 1°84 61 15| :03 ‘OL| - - M. 
Halls 236) 1-18 | ‘61 By ON ‘09 ‘05 03) ‘Ol; — | — = B. 
Pavia 4 3 el eee fee lL catty tea 0. 
SO) 382624) 2:17 87 26 ‘06 Ol = | = M. 
4:14] 2°76 e538) 79 40 19 09 O05 02 01} ‘01 |) — | = B. 
Cagliari 5 1 1 1 SV es eel aes peo ee eee | a ea ee eS 
OS Ocou eouiOln woo “99 42 IL) 04 OL y= —}— = M. 
4:07] 1°89} 1°17 79| 55! -39| -28| -21| -15| -11/:08|-°06/ -25 |B. 
Padova 3 3 _ 2 = 1 a |e == = 1/— = 7 |KO): 
‘91 | 2°18) 2°61} 2°09] 1°25 ‘60 24 08 03 Ol}; — | — — M. 
o2'| 2°03) 1°40 98 MON 250 36 26 19 13°] “10 7 16 | B. 
Verona 4 3 — 1 = = 1 ome ee oe |e | i O. 
OM Ql Sei :6la|) 22098) 125 ‘60 | +24) -08 03 ol) — J] — — M. 
4:07 | 1°74] 1:09 ‘75 4 40 | °31 733 18 14 | -11 | ‘09 Fata) || 15%, 
Brescia 2 3 2 2) = == = = a ee | 13 0) 
ES alan le QO 2 Oil) || a2, 1°82 | 1°20] ‘66 31 33 05 | 02 | — — M. 
4°99) 1°42 87 6 47 coll 30 24 20 17 | *14 | °12 79 |B. 
Bergamo 2 — 2 2 — | 1 — 1 1 1 }/—/—] — |9. 
*20 79)| 1°54) 2:00)) 1°95 1°52 99 5)5) 27 12 | 04! -02 ‘Ol | M. 
Hotere dol 1:57) 1:46) 1723 98 74 54 38 he | Altsy | aly 24 |B. 
| 
Catanzaro 3 3 1 1 1 |g en |e | 1* | 0. 
*20 “ON 1254.1 2:00)) 5951) 1:52 "99 YD) ||) 47/ 12 | :04 | 02 ‘O1 M. 
4°80 |} 1:20 el 50 38 ill 25 21 18 LG 4 | el2s 04s 1B: 
Vicenza 3 = 1 1 1 1 = 1 1 = — | — 1 O. 
‘17 GSae eso alee 1:95 | 1°60 | 1:09 BS} 15 | 706 | °02) -O1 M. 
Qe SOs da? 1eQBnle LeODie 282, 65 51 39 30 | SBA PI | °48 B. 
lias ae | 
Totals Somos, ie i.| 2 2 CRden No Wo sor ery) wae. Os 
19°24 | 24°97 | 21°50 | 16°54 | 11°76 | 7°58 | 4°38 | 2:25 | 1°07 ‘46 | °16 | -06 03. «| M. 
40°25 | 23°70 | 14:05 | 8°58] 5°78 | 4°16 | 3:08 | 2 L254 ed 59 Olles7ios eo soln b 
| 


Brescia, and 1 at 27 in case of Catanzaro, if the means were to agree with those given by Mortara. 


* 1 at ‘12 or more’ in cases of Brescia and Catanzaro was found to signify 1 at 20 in the case of 


62 On the Poisson Law of Small Numbers 


In other words the binomials give a reasonable total fit, the exponentials a 
practically impossible one. 


But there is another question to be asked in such series as those of Mortara: 
What justification is there in cutting off at 10 cases, say of murder? <A province 
may have a million inhabitants and, perhaps, 40 murders occur in a year*. Hence 
the binomial is for ten year returns 


10 24,999 il 1,000,000 
x eecce a saat) 


but this is as close as anything can be desired to the exponential series. It may 
be reasonable to apply a separate series to districts giving 4°2 and 36°6 murders 
per annum respectively, but it is difficult to see why the latter district should be 
altogether excluded from treatment. If the theory of the binomial be applicable 
at all, then it applies practically as well to districts with 40 murders as to districts 
with 4; for, we need no indefinitely small g to get a closely exponential series. 
If we take the case of deaths by murder, Mortara has retained only 6 out of 16 
provinces, yet his criterion @’ (see his Table, p. 51) is not more divergent from 
unity for the rejected provinces than for those retained ; the binomials are indeed 


Reggio Treviso 10( ‘7000 +3000)" 
Venezia 10( °5619 + °4381)s" 
Vicenza 10( 9571 + 0429)842! 
Padova 10 ( 4774 + 5226)» 
Pavia 10 (1'8162 — -8162) °° 
Bergamo 10( ‘8857 +1143) 


only one of which gives qg small and positive and n large. 


The mean Q for the retained provinces is ‘967 with a range from ‘7 to 14 and 
for the rejected 1:03 with a range from ‘8 to 1:4. Even if—which is not the case 
—the probability of an individual being murdered were too great for the ex- 
ponential, it ought to follow the binomial, but this, as a rule, it does not do, unless 
we give some wholly new interpretations to g and 7; the actual values render the 
theory of the binomial as stated inapplicable. 


(10) Mortara’s Criterion. 
As a matter of fact the only test of whether an exponential will legitimately 


fit a given series or not is to determine the binomial (p+q)”" and ascertain 
whether p is slightly less than unity. But: 


p = npgq/ng 
_ (Standard Deviation) 
7 Mean 


* We assume that each individual is equally likely to be murdered. But if there be a graduated 
probability for murder throughout the community, what right have we to apply Poisson’s series at all? 
The essential basis of the application—equal chance of each individual—is wanting. 


Lucy WHITAKER 63 


Now if m, be the number of deaths, say, occurring in any year and there be 
1 years under consideration, then: 
S? (ms — nq 
l : > 


(Standard Deviation)? = 
or, if we use the form preferred by Bortkewitsch* 


8S? (ms — nq)? 
eet ee 
S? (ms, — nq) 

(l—1)nq © 

This in other notation is Mortara’s Q”, the only criterion he’ actually uses 

provided by his equation (17 ter), p. 18. Thus his Q’, which he says must not 
differ much from 1, is only /p, and it would be better to use p—which has a 
direct physical meaning—than Mortara’s Q =,/p. Clearly Mortara’s somewhat 
elaborate process of deducing Q’, does not amount to more than saying: Fit a point 
binomial and test if p is slightly less than unity. We contend that it is best 
straight otf to fit the binomial. 


Hence : p= 


It is true that Mortara does not reach his Q”, our p, by the simple process of 
asking whether the binomial is one with a positive probability less than unity. 
He endeavours to obtain it by considering whether there is “lumpiness” in the 
observations. But it seems to us clearer and briefer to ask: Are the contributory 
cause-groups independent as in teetotum spinning? If so, the data will fit a true 
binomial and p will of necessity be a positive quantity less than unity. If they 
are not of this character then p must of necessity be greater than unity. It is of 
interest to see how Mortara’s test of dependence of contributory cause groups 


2 


leads to a criterion, but he actually only gets his Q”, Le. our binomial p after 


2 


a series of hypotheses which much limit, and that in no very obvious manner, 


* The use of NE or en in the value of the standard deviation when l is small has been several 
times discussed. It may be dealt with as follows: The probable errors of a mean as deduced by the 
two processes are 

E=-67449 . o/a/1, 


and E’=-67449 .o/,/t-1, 


now B= 61449 6] JT(1 +5) +-~- ] 


1 
= -67449 +. (« eee ot. ) 
wl Not OE 
1 
may and —— is less and often much less than ‘67449. 
wel /21 
Hence if we only know o from the observations themselves, and this is the usual case, we have: 
1 / 
2s gh 
Jt 
where o’ differ from o by a quantity usually far less than the probable error of ¢. In other words the 
refinement of using H’ for F is idle having regard to the accuracy of our observations; and the form 
used by Bortkewitsch and Mortara with ,/1—1 for nibs of no importance. 


Now the probable error of o is °67449 


E’ = -67449 


64 On the Poisson Law of Small Numbers 


the nature of those contributory causes groups. Of course if their dependence 
were of the nature of successive draws from a pack, then the result would be 
a hypergeometrical series and Q? would have no physical meaning for the series 
at all. 

(11) We will deal with one further illustration out of many considered by 
Mortara which are of like character. In the case of Marriages of Uncle and Niece 
(see Table XIV, p. 65), where the distribution of Q’s is the most favourable 
for his theory, the binomials are 


Reggio Marche 10( "7000 + -3000)'° 


Umbria 10( :9000 + -+1000)°° 
Basilicata 10 (14000 — -4000)7* 
Sardegna 10( °44545 + °55455)198% 
Emilia 10( 9818 + -0182)2020 
Abruzzi 10( 8429 + °1571)788 
Lazio 10 (12548 — -2548)—12 1646 
Puglie 1OGeS =o ace 
Veneto 10 (1:34.44 — +34.44.)~1'S064 
Toscana 10:(2:2667 A266 7) 28" 
Calabria 10 (13584 — -8584)—2#°88 


of which only one (Emilia) approaches the conditions for an exponential distribu- 
tion. If we test the totals at the foot of Table XIV, we find the result much to the 
advantage of the binomial, for which P = ‘902 as against ‘714 for the exponential. 


(12) On Mortara’s own showing nearly all the Qs of his numerous series are 
greater than unity, and very few of the binomials are positive. If we consider the 
distribution of Q's, given in his work omitting Table 13 (Deaths from Malaria) we 
find a range from ‘5 to 3°6 with a mean Q at 

1:2565 + 0847, 
while for the distribution of all the p’s in the binomials we have determined, we 
find a range from ‘4 to 15°6 with a mean p at 2°5655 + ‘3817. 


These results are sufficient to show that there is no real distribution of p round 
the value unity but the binomials have a distinct tendency to be negative. 


(13) But the whole theory of Poisson’s exponential law in the hands of Bortke- 
witsch and Mortara appears essentially vague. The binomial is built up on the 
assumption of the repetition m times of a number of independent events, of which 
the chance of occurrence is identical and equal to g. The population is n and the 
chance of occurrence q in the case of each individual. The mean frequency of 
occurrence is ng. But if g be very small we have seen that the series 1s 


—m [1 , m me 
e ppnow nt soy v8 ; 


i —— 


Lucy WHITAKER 65 


TABLE XIV. 
Marriages of Uncle and Niece (1900—1909). 


: as 
0 1 ze | 8 pe Nae |e oer |e om ip it | 22 | 23 | 14 15 | 18% 
| | over 
er 
Marche O.| 7 3 = = | | 
M.| 7-41] 2-22] -33] -04 | | | | 
Bale iz 2 | 
Umbria 0. 6 3 1 — — | | 
M 6:06 | 3°03 s(oa wld 02 
B. | 5°90} 3:28 ‘73 08 | -— 
BasiicamOnime. | o3 | — | 1 | — | ; | | | | | 
M 5°49 | 3:29 99 ‘20 03 | | 
B. | 6°04] 2°59 92 31 10 03 Ol. | 
Sardegna O. 2 5 3 = - 
M.| 3°33] 3°66) 2°01 74 20) °05 O1 
Pelpeccores-o6 | 3:03) = | — | — | — 
| | 
Emilia =O. 1 3 2 Zee 1 — 
M Il-l1l| 2°44) 2°68} 1:97] 1°08} °48]) ‘18 05) -Ol 
B 1:09] 2°48} 2°70] 1:98 | 1:08 47| ‘17 05 Ol 
Abruzzi ©, || — SUF whee ail 3 | 2 _- 1 — | — | 
M 61) 1°70) 2°38) 2:23] 1°56 8 Se G 06 02 
B Se elon mcd S|) 2°43 | 1°68 87 34} *11 03) — 
isazion ©. | 1 eal a2 3) — | 2}/—) 1] —) — 
M ADM eA One Dele 224 173!) 1:07 5D 25; 10} ‘03 OL 
B 63} 1°56! 2°09} 2:00} 1°54) 1°01 59 31 14} -06 03 O01} ‘O01 
Puglie O.| — 30 al 2 nh peal 1 72 ae bbe |e fee ae | 
M 27 FOS) wake) Loi U38 | “835 427) =19) 08 03 01 | — | 
B Sone lesOny We77 | 1-80 | 1538 |1:14| -77| °49) 29] +16) +10) :05] :03 | ‘Ol | ‘OL 
| 
Veneto O. 1 = 1 l 3 1 i a 2 | | 
M. 11 SHOP aoa OOM le OO! Ts fn) 28 82) “46 23 10 OA 025-01) 
Bs 21 ‘70 | 1°26] 1°62] 1°67 46 | 1°13 TS Mh > aap 30) 17 09 | 05 | °02 | -O1 | O01 
Toscana O. _ =e 1 2 2 2 1 Le — | — 1 
M. O04 24 66) 1:19) 1:60] 1°73 / 1°56) 1:20} -81 49} °26 13 | 06} ‘02 | 01 | — 
B. 31 SM lsOvaleelaton al 2ifal elie) 1-00 83 | °65 5O 37 27 | 19 | 13} 09 | 06] :10 
| 
Calabria O = - 2 2 1 1 1 ented tg | —| 1 
M — Ol 05:; “16. 36 4) 94] 1:20 | 1°33] 1:32] 1:17 | :95) -70| 48 | -31 | -18] -20 | 
B 00 O03 wi; 26 48 73 96 | 1:16 Deal OL C7 6Ou plaleon I 2a. a6 
: | ea | 
Totals O. 24 24 1} || ile 8 9 5 5 3 it 1 it = |) SE ee al 
SS M. | 24°88 | 19°47 | 14:93 | 12°72 | 10°39 | 7:93 | 5°76 | 4°10 | 2°96 | 2-17] 1°57] 1°13 | -78| -51 | -32 | -18] -20 
\ B. | 24°22 | 22°16 | 16°46 | 11°73 | 9°35 | 6°88 | 4°98 | 3°74 | 2°84 | 2°19 | 1°71 | 1:29 | -97 67 | 48 | °32] -26 
| 


} 
J 
As Biometrika x 9 


66 On the Poisson Law of Small Numbers 


from which x has disappeared, and in this exponential we have seen that 
Bortkewitsch and Mortara suppose m small, ic. 10 or under. We have seen 
that there is no reason why m should be absolutely small, and that the name 
given by Bortkewitsch to the Poisson-Exponential—ie. the “Law of Small 
Numbers ”—is misleading. But supposing the mean occurrence m to be small, 
it by no means follows that g need be small and n finite. For if g="2 and n=4, 
m would be “small ”—and the sort of small number with which our authors deal, 
but the mere fact that the mean frequency of occurrence was 2 would not justify 
our using the Poisson-Exponential for 
(Golam ey) 

The fact is that when our authors speak, of the deaths in a Prussian Army 
corps from the kick of a horse, or the suicides of schoolgirls, or the deaths from 
chronic alcoholism as being “small,” they really mean small as compared with the 
number of persons exposed to risk. They had probably in mind all the men in 
the army corps, all school-girls or all individuals liable to death in the towns 
considered. But are all men in the army corps,—or only the cavalry, the artillery, 
etc..—equally liable to death from the kick of a horse? Is every school-girl equally 
hable to commit suicide or only a very few morbid and unhealthy minded girls? 
Is every individual equally liable to die of chronic alcoholism, or only perhaps the 
10 or 12 confirmed and aged drunkards in a town? The moment we realise these 
doubts, what is the population n to be considered? It is not m being small, but 
the smallness of m/n that leads us to believe that the binomial may have passed into 
an exponential. But if only six school-girls per year in a community are in the 
least likely to commit suicide, what is the justification for the “law of small 
numbers,” if the average number of suicides be 65? Further, if we pass to even 
a large community in which the tendency to commit suicide is graded—a very 
probable state of affairs—m might be small and n large, and yet since q is not 
constant, the binomial and its exponential limit would not be applicable ; and this 
non-applicability would not depend on “lumpiness””—i.e. contagion or example in 
occurrence. Thus the probability might be: 


(Pit h) (P+ G2) (Ps + Gs) +++ (Put Yn) 
with all the p’s independent (as in spinning differently divided teetotums) and not 
correlated (as they would be in drawing successive non-returned cards from a pack). 
It would seem therefore that a priort we should not expect the conditions for the 
exponential to be fulfilled in most of the cases selected by Bortkewitsch and 
Mortara, although with perfect mixing we might expect it in the cases cited 
by “Student.” 


(14) In order to test this point on adequate numbers, the ages at death of all 
persons dying over 70 years of age were extracted for a period of three complete 
years from the notices of death in the 7imes newspaper for the years 1910—1912: 
see Table XV. These announcements of death are those of individuals in a fairly 
limited class, which may be considered stable in numbers for these three years. 


67 


Lucy WHITAKER 


| | 80- OLLI I ol 
| | | | GE. 8F-1 I c 
| | FR: LE-€ r II 
| | | E9-E 96-9 F OL 
| OF. | C6: L lI €P-t LUFL | Sl 6 
| | | LEI OL-€ € 66-81 8E-LE cE 8 
Ge. | &6- | L OF-¢ 88-6 8 FL-€F iL. 6P TF Z 
Le. 18-@ é LLL | GP-DS LE LL-G8 | 68-€8 | 18 9 
CF.9 IL-OL €1 €E-6F | G6-G¢ 19 CL-9FL LT-6é1 611 G 
GFF LG. G Le. Le L8-6E FE-FIL | &6-IIl Lo ¥G-L06 | Té€-LL1 691 ip 
[€-¢1 0-91 | LI GF-Z6 1-6 68 FO-GIZ | GF-Z61 Est SL-CEZ | OF-60E | Gz g 
1€-F6 T0-F6 F6 GL FEE Ge-L1E | gle E6- F6G | 06-69@ Le 68-006 62-006 | LOE | o 
LE-9FE 88-0GE OGE O€-L6E | OG-GLE | OLE LE-ELG | LE- PLE LOE IL-FLL €0-6€1 OFT 
69-GE9 L€-€E9 €€9 CE-9EE | 19-€9€ F9E SL-9@1 | 18-G¢L GOL If-G& | G66 OF 0 
Ox: | 8L-6 |. “1 II 
| Q8-€ 98-¢ r 01 
| 96-01 | LF-€I | 61 6G | 
PP. Ciel il G8-TG | C9-8E | ike | 8 
89-1 | 90-6 i FO-6G =| 99.e¢ | 69 | Z 
88-9 | ¥8-6 9 ST-66 FE-L6 |! 8 9 
91-1 | COsG 1 CLE | 9G.86 (ac G6 8¢cT €G-O0¢1 CFL | G 
| €E-6 | 18-6 LL 19-0L | 86-éL 6L  O8-ELE 60-661 L81 Y 
97-€ wre | & GBF | 69. EF GP SI-C9L LLGT LS F8-9G% 89-216 | ITE 8 
6F-GE G8-Ze | BE 0G-€91 F0-G91 FOL 18-682 OL- TLE 9G BL-181 O€-F81 OLI o 
00: EE OL-O&E CGE 91-966 | 8L-16E 16€ 66-8 8L-GEE 6EE 11-46 | OG-LOT | OIL ib 
QE. 868 10-668 T&8 €8-O8F | LG. P8F F8r 9@-86L | LE-BLE GEE T6- Ge FF-GE | €€ 0 
[ey | Bl | | | | | | 
ay (ae eIulouUurt DIATIS eH | yBVIUTOTT | VALS) (en | B | paaaos' Biyuauodxy ertulouUutl JATOS 
-uauodx gg | 1?! iq | I 40 -usuo0ds gy | [vl lq | P 90 | -uauodxg | (PRMOUra P qo eo | Te! lq | P qo motp aad 
== : ae : pleas eS —— —— sired 
jo Tay nN 
TIAQ) PUB SIVAK YG IIAQ purv SIvAK GY IBAQ puB SABI OF TAAQ PUB SIBOR OL 


-vadndsmau sowty, ay2 woul pabp ayy fo hivp sad syynaqr 
“AX WIAVL 


“TOTO AA 


“Ud TAL 


68 On the Poisson Law of Small Numbers 


Table XVI shows that the announcements of deaths over 70 years of age only 
amount to 3°74 per day for males and 3°52 for females. These are certainly “small 
numbers,” but “small” with regard to what? Are we to consider n as the number 
of the population which embraces, (i) all the individuals of the limited classes of 
the same range of ages as the defunct, (11) all the individuals announced as dead 
on the same day, (iii) all the individuals of whatever ages of the class which 
announces deaths in the Times? Or, should we refer to all the individuals in the 
community of that range of ages, or the whole community at large, 1e. the chance 
that in a population of so many millions an individual over 70 or 80 as the case 
may be will die and have their death announced in the Times newspaper? Well, 
it really does not matter, because if for any one or all of these populations the 


binomial (p + q)” applied, we should get if g were small and n large, the Poisson 
series 


Cm @ +m+ ae nag + 4 
Ziel ; 

and this quite regardless of the size of n. If therefore we did find a series in 
which g was very small and n large, we might not be able to say to which, if any 
of the above populations n applied. On the other hand the mere fact that m is 
small is no justification for the use of the “law of small numbers” as is sometimes 
implied. If it be argued that the small number of people who die over 80 and 
have their names recorded in the Times are drawn from a small population, we 
reply so it may be argued are the school children who commit suicide, the uncles 
who feel any inclination to marry their nieces, or the men liable to die of chronic 
alcoholism ; and we can in the case of the announcement of deaths test the values 
of g and n on fairly adequate numbers. As a matter of fact we do not know, in 
attempting to apply the Poisson formula, what is the population from which we 
are drawing our individuals, and the justification of the Poisson formula lies only 
in showing that there actually does exist a binomial for which qg is small and 
n large. We might imagine that as we got to the higher ages practically every 
person of that age would die, or that in our notation q would be 1 nearly and p be 
a very small quantity ; thus an approach might be made to the Poisson-Exponential. 
But the approach to the Poisson-Exponential arises not through q approaching 
unity but from q becoming very small. Nor again in the lower age groups do we 
find ourselves left with a positive binomial. 


In all cases except women over 90 years of age, we find that a negative 
binomial best fits the observations. Even in the case of the announcements of 
deaths of women over 90 years, we find that the approach of the binomial to the 
Poisson exponential depends on 


/ i: 53°3333 
(1 a 553555) 


being measured with sufficient approximation by e = 2°71828. But 
(1:01875)?°8 = 269323, 


Lucy WHITAKER 69 


and is therefore not a very close approximation, a result shown when we use 
a binomial by the substantial improvement in the measure P of “goodness of 
fit.” Even in this case we are not prepared to say what is the population for 
which the g = ‘01875 in the case of these announcements of deaths of women over 
90 years of age. It can scarcely be that there are only 29 women over 90 years 


TABLE XVI. 
Constants for Deaths of Aged. 
Men. 
| | | 
Probable _ Probable lee omiaal Expo- 
Age over | p q Error n Error m , P a | nential 
| of q | of n P 
| 70 years... | 112965 | — 12965 | + 03314 |—28°8747 + 7°3734| 3°7436| °1355 0045 
| 80 years... | 1°12152 | — 12152 | + 03349 |—14:0703 + 3°8704 | 1°7099 | °9358 | *1129 
| 85 years ... | 1:01903 | — 01903 | + :02902 | —43:2996 + 67°5797 | °8289) “9737 | “9715 
90 years... | 1°00654 | — :00654 | + 02934 | — 42°8498 |+192°3069 | -2801 6741 | 6672 | 
| | . | | 
Women. 
= , = | 
| Probable Probable | | Bi al Expo- | 
Age over Dp qd | Hrror n Error m eee nential 
of q | of n | ee 
| 
ei. Paice: 
70 years ... | 1°34012 | -- 34012 | + 04161 |—10°3522 | + 1°2307 | 35210) -8084 | -0000 
80 years... 1°20770 | - -20770 | + °03294 |—10°4400] + 1°8309 | 2°1569 -9686 70018 
85 years ... | 1°14507 | — °14507 | + 03077 |— 8°1447) + 1°9627 | 1:1816| ‘9860 | -1062 
90 years ...| *98125 | +°01875 | + ‘02779 | + 29-0573 +43:0634 | 5447| ‘9848 | -8116 
| | | 


of age living in the country, whose deaths are likely to be announced in the Times 
when they occur. Further the probable error of qg is such that actually this case 
might equally well be a random sample from material following a negative 
binomial. Analysing our material we see that our first two cases of males and 
the first three of females are such that they could not possibly be random samples 
from positive binomials, the probable errors of q are too small. Next, seven cases 
out of the eight do give actually negative binomials and the eighth might, having 
regard to its probable errors, well be a negative binomial. Thus although our 
daily occurrences are certainly in Bortkewitsch and Mortara’s sense “small numbers,” 
they give no support to the use of a Poisson-Exponential. 


If it be said that these “small numbers” differ in character from those used 
by our authors, the reply must be: we know in none of these cases the real 
population from which deaths are to be considered as drawn. The chances of 
death are certainly graduated with age, but the chances of suicide are graduated 
with temperament, and the same is true of alcoholism, or again the chance of 


70 On the Poisson Law of Small Numbers 


death by accident is graduated with occupation. At any rate until those who 
support the use of the “law of small numbers” demonstrate its application on 
material, where the probable errors are sufficiently small for us to measure the true 
value of gq and n, no advance can be made. Nor until we have clear ideas of the 
population in which the chance is q, is it possible to assert that it may be used 
for the suicides of school children, and the marriage of uncle and niece, and must 
not be used for the deaths of aged people, which certainly occur in “smaller” 
numbers. 


In the illustrations of deaths we have taken, certainly the Poisson-Exponential 
is not the rule, although the distributions appear to approach it, as towards a limit, 
when the number of deaths approach zero. But our data which show the rule of 
the negative binomial appear to show it in no more marked manner than much of 
the data selected by Mortara himself indicate the negative binomial, although owing 
to the sparsity of his material his results are far more erratic and unreliable. Nor 
is Bortkewitsch much behind Mortara in the evidence he produces for a negative 
binomial being as reasonable a description—possibly owing to inherent lumpiness— 
as a positive binomial of these “small number” frequencies. 


(15) Conclusions. 


(a) The Poisson-Exponential gives a fairly reasonable method of dealing with 
the probable deviations of small sub-frequencies in the case of random sampling. 
When the average value of a sub-frequency is not more than 3°/, of a population, 
then Poisson’s formula suffices in most practical cases to determine the range of 
error likely to be made. Tables are given to assist its use. 


(b) The application of the Poisson-Exponential to various data by Bortkewitsch 
and Mortara has hardly been justified by those writers, for they have not tested 
whether the probability q is small and positive and the power n large and positive 
in the cases considered by them. When this is actually done, it is found that 
their hypotheses, having regard to the probable errors of q and n, are largely 
unjustified in the case of their illustrations. Even in such cases where it is 
justified, a binomial gives a better result as measured by the test for goodness 


of fit. 


(c) Negative binomials repeatedly occur and give just as good fits, where 
they occur, as positive binomials. In the illustrations taken by Mortara, the 
frequency 10 used is so small that it is not possible to assert that either positive 
or negative binomials are demanded by the data. Still the average p of his results 
is very significantly in excess of unity. 


(d) Mortara like Bortkewitsch cuts out of his data straight off all districts 
with, on the average, more than 10 cases in the year. But the g obtained from 
20, 40, or even 100 cases in a population of 100,000 is a small g in the sense that 
the resulting binomial is adequately expressed by a Poisson-Exponential. There 


Lucy WHITAKER 71 


appears to be no valid reason for such a procedure, except the experience that 
many such cases actually give negative binomials*. It seems to us theoretically 
unjustifiable to apply the exponential to 8 cases say in a district of 100,000, and 
not apply it to 12 cases in a district of 200,000. Actually p may be 1:4 in the 
first case and only 09 in the second. 


(e) We consider that the reasonable method in every case is not to start with 
the Poisson-Exponential, which screens the truth or falsity of the a@ prior 
hypotheses, but to fit a binomial regardless of the magnitude of p. The fact that 
quite as good fits are obtained with negative as with positive binomials suggests 
that a new interpretation of these cases of “negative probability” is requisite. 
Several cases of the interrelation of “contributory cause groups” which provide 
a series represented by a negative binomial (p—q)~” have been recognised f. 
A general interpretation based on a very simple conception seems needed for 
these demographic cases in which the law of small numbers appears far more often 
to correspond to a negative than to a positive binomial. 


This paper was worked out in the Biometric Laboratory, and I have to thank 
Professor Karl Pearson for his aid at various stages. 


* Can we cite in addition perhaps, the fact that existing tables of m*e~”"/x! do not extend beyond 
m=10? 
+ Pearson, Biometrika, Vol. tv. p. 208. 


THE RELATIONSHIP BETWEEN THE WEIGHT OF THE 
SEED PLANTED AND THE CHARACTERISTICS OF 
THE PLANT PRODUCED air 


2 


By J. ARTHUR HARRIS, Ph.D., Carnegie Institution of Washington, U.S.A. 


I. Inrropucrory REMARKS. 


1. In Biometrika, Vol. rx. pp. 11—21, March 1913, were published constants 
showing the relationship between the weight of the seed planted and the number 
of pods on the plants produced in twenty experimentally grown series of Phaseolus 
vulgaris. From the economic view point, number of pods is the most important 
character which could have been chosen, total weight of seed matured only 
excepted. But to the student of morphogenesis, or of the physiology of seed 
production, other characters are of equal interest, while the comparison of the 
correlations for various features must yield results of significance. 


The purpose of the present communication is the presentation of the constants 
measuring the influence of the weight of the seed planted upon the number of 
ovules formed and the number of seeds developing in the pods of the matured 
plant. 

These various relationships have now been worked out for a relatively large 
bulk of material. Altogether there are 29 individual series belonging to 5 
varieties, involving 17,953 plants, from which 119,192 determinations of the 
number of ovules and seeds per pod have been made. The reply to the possible 
suggestion that the expenditure of effort in the collection-and analysis of such 
masses of data is quite unjustifiable is twofold. First, a major portion of the 
labour involved was necessary for investigations not touched upon here. Secondly, 
there are many problems of morphogenesis and physiology which can only be 
solved by the amassing of large series of accurately determined biometric constants 
which when sufficiently numerous may themselves be the materials for statistical 
analysis. The data here contained are recorded in partial fulfilment of such 
requirements for certain definite morphological and physiological problems. 

The present paper is limited strictly to matters of fact; general discussions are 
reserved until further data—much of which is already available in a raw state— 
are reduced. 


— Te eee ee ee le 


——_ 


J. Av HRARRIS 


TT. 


MATERIALS. 


The first paper may be consulted for details not entered here. 
analysed are drawn in part from the series already considered for the relationship 


between weight planted and number of pods produced. 


73 


The data 


In addition to the White 


Flageolet, Navy and Ne Plus Ultra varieties already treated, several lots of 


Burpee’s Stringless and two of Golden Wax are available. 


III. 


ANALYSIS OF DATA. 


2. Data for Number of Ovules and Seeds per Pod. 


Tables III—VI, similar to those of the preceding paper, give in a condensed 
form the data for the correlations discussed. 


Table I* gives the correlations 


TABLE I. Correlation and Partial Correlation Coefficients. 
| | | 
Namber coro: Nummberc! Correlation, Partial Correlation, Partial 
F eight Weight | pare Weight | : 
Series of end Pos of Pods | eadiOvdlese| Correlation, and, Seeds Correlation, 
Plants Twp Examined | no mane ng pws 
| | | 
LL 1141 |--008+°020; 8043 026+°008; ‘027+°008 —°013 +4008 | — 013 + :008 | 
LG 182 066 + 050 806 1534023 | +140+°023 —-100+°024 |—:103 + 024 | 
GG 750 |-°368+°021) 6310 018+°008' -029+'008 ‘004+°008} ‘0164-008 
GGH 583 208 + 027 5251 045+°010) 01947009 024+ °010 | — 004+ ‘009 
GGH2 499 176+ 029 3502 0934°011;) -083+°011 063+°011} 049+-011 
GGHH 396 193 + 033 2656 — 022+ °013 | — -042+°013 —:-029+-013 |— 048+ 013 
GGD 514 "159 + 039 1438 ‘1074018; ‘089+4°'018 ‘O71+°018} -068+°018 
| GGD2 449 215 + 030 1227 0444°019;) °018+°019 ‘079+°019] ‘0624-019 
| GGDD 342 137+ 036 | 807 1014023} -092+°024, -089+°024} -076+°024 
A 1484 177+:017| 14029 010 + :006 | —"039+°006 = 007 + :006 | — 054 + 006 
| HHA 1271 1454-019) 11230 — ‘000 + 006 | — -030+°006 = 016 + 006 | —-014 + 006 
HD 1416 "129+°018| 5581 — 044+ °009 | — 067+ :009 — 049+ 009 | — 052 +:009 | 
| HDD 1204 121+°019;} 5449 — 029 +009 | — 065 +4°009 —-010+:009 |— 030 + ‘009 
DD 513 282 + 027 1827 098+ °016) -009+°016 0504-016} -008+-016 
| DDD 459 215 +°030 2018 0444°015|} ‘0004°015) 0464°015} ‘006+4°015 
| DH 670 258 + 024 5955 075 +009 | — ‘005+°009 = ‘076 +009 | — (013 + 009 
| DHH 565 152+°028, 5019 0454°010} *008+°010, ‘011 +°010 | —-025+°010 
OSC 530 150+°029| 2569 059+ °013} 032+ °013 | 031+:°013} -024+°013 
USS 680 155+:025| 6605 023 +008 |—-000+°008) -041+°008| -024+ 008 
OSH 361 129+ °035 3.406 ‘032+ :012| 0014-012} 0374-012] -020+-012 
| USHH 224 1434-044 1743 112+°016} ‘098+°016 ‘011 +016 |— -004+-016 
USD 312 195 +037 | 802 127+:023/ °098+:024; -071+:024| -067+-024 
| USDD 237 241 +-041 | 851 2384022} +175+°023|) °1314°023] -090+:023 
FSC 586 147 +027 2876 047+°013] -017+°013|} :089+°012} -073+°013 
FSS 868 098 + 023 7809 021+°008) *001+'008) -026+4-:008; -004+4-008 
FSH 475 "100 +031 4541 049 +°010} -018+°010 —-045+°010 |— -073 +:010 
FSHH 427 121 + 032 3837 015+°011 /—-013+°011  -0404°011) ‘0174-011 
FSD 428 130 + ‘032 1449 "060 + 018 | — 027+°018 —-019+-018 = +036 +018 
FSDD 387 144+ 034 1556 0B7+°017) 013 +017 | ‘047+ °017) 024+ 017 
| 


* The weight of the seed planted was weighted with the number of pods counted. 
Sheppard’s correction was used for seed 
weight, but not for the integral variates ovules per pod or seeds per pod, 


differ slightly from those of Table II of the first paper. 


Biometrika x 


Thus w and o, 


10 


74 Weight of Seed and Characteristics of Plant 


between weight of seed planted and ovules per pod, 7,., and between weight 

8 Pp per } 8 
planted and number of seeds matured per pod, 7s. The partial correlation 
coefficients, 3 


i= Li wo me wp Pr. po ae Tws — Trp pss 
p' wo P = ) DWS: = 5 2 5) 
v1- “wp Vl=r r ‘po V1 — Pop V1 — ns 


showing the correlation for weight (w) and ovules (0) and weight and seeds (s) for 
constant numbers of pods (p) per plant are also given. These require in addition 
to the correlations here given 7), 7p). and r,s, the correlations between the number 
of pods per plant and the number of ovules and seeds in these pods. Values 
of 7», are available from the preceding paper (Biometrika, Vol. 1x. p. 21, Table 
VII) and from a supplementary table giving nine additional constants*. For the 
reader’s convenience these are reprinted in this table. The values of r,, and ry, 
will be published in connection with another problem. 


The probable errors have all been calculated on the basis of the number of 
pods examined as V. There is considerable question whether the actual number 
of seeds planted should not have been used instead; the degree of trustworthiness 
of a constant is perhaps not greater than is indicated by the lowest number of 
actual measurements (irrespective of the number of associated measures taken). 
The point is not of the greatest practical importance for the present case, since the 
number of series is so large that conclusions can be drawn from the run of the 
constants as a whole and too much weight need not be given to individual series. 


A glance at the table shows that the correlations are low throughout. The 
suggestion naturally arises that some of the extremely low values may be due to 
non-linear regression. The regression straight line equations and the results of 
Blakeman’s test} are given in Table II. Here 7, 7 and the straight line equation 
for the regression of ovules and sceds per pod on weight planted (in working units) 
are determined by the conventional formulae. The final two columns give the 


values of 
vil se Mee 
ue: Miele =i" 


when €=7?— 7? and xy, = 67449//N. 


All the straight lines are shown in Diagram 1. The empirical means are 
indicated in all of the cases where it can be done without confusion. The slope is 
very slight and the agreement of observed and predicted means not very close, 
especially near the ends of the range, where the number of observations is small. 
There is, however, no clear indication that a curve of a higher order would describe 
the results better than a straight line. This irregularity is precisely what is to be 
expected in cases of low correlation. 


* Harris, J. Arthur, ‘“‘An Illustration of the Influence of Substratum Heterogeneity upon Experi- 
mental Results.” Science, N. 8. Vol. xxxvi. pp. 345—346, 1913. 
| Blakeman, J., Biometrika, Vol. 1v. pp. 332—350, 1905, 


~T 
Gr 


J. A. Harris 


TABLE II. 
Tests for Linearity of Regression. 
| | _ | 
Correlation, r, | Correlation-Ratio, | Regression | Blakeman’s | Blakeman’s 
Series and n, and Straight Line —_ Criterion, Criterion, 
Probable Error | Probable Error Equation Test Ad Test B 
x | 
a eel | 
For Ovules: | | | 
USS 0232 + °0083 0657 £0083 | -54230+4-0074 wv 3°720 | 1°688 
DHH 70445 + *0095 0788+ 0095 | 4:°9385+4°0257 w 3°431 1151 
USDD 2381 + 0218 "2978+ 0211 3°6886 + “LOOL w 11°096 2°102 
GG D2 0442 + 0192 1276+ ‘0189 4:7224+4 0137 w 3°152 1:967 
FSS 0209 + ‘0076 0403+ °0076 =| 5560640153 w 2°263 “754 
HH “0098 + :0057 "0661 + °0057 5°3600 + 0056 w 5°159 1°678 
For Seeds: | 
USS ‘0407 + ‘0083 0946+ 0082 | 3:5870+°0206w | 5"182 2351 
DHH 0111 + :0095 05414 °0095 | 4°1521+4°0106 wv 5573 1°869 
USDD 1313 + :0227 1932+ °0223 | 2°1840+-0940 w 8712 | 1°650 
GG D2 0794+ 0191 "1760 + ‘0187 2°4735 + 0846 w | 4181 | 2°529 
FSS ‘0261 + 0076 0499+ :°0076 =| -3°0712+ "0269 w | 2°793 931 
HH ‘0068 + *0057 0953+ 0057-42119 + 0058 w 8421 2°739 


Blakeman’s criterion has been applied in two ways, A and Bb. In the first the 
actual number of pods examined has been taken as V. In test B the number of 
seeds planted (not the weighted number) has been used in obtaining x,. If the 
first test be accepted as the proper one, it follows that regression cannot safely be 
regarded as linear. But there are two important points to be taken into account. 
The correlation ratio 7 depends upon the squares of the differences in means, hence 
it has always a positive value, which may be very substantial because of the errors 
of sampling when the number of individuals per array is small. Thus when r 
approaches zero 7 is limited by 7, the mean values of 7 for zero correlation*. 
Hence a test for linearity based on a comparison of 7 with a very low value of r 
may be misleading, Again, as pointed out above, the significance of both r and 7 
should perhaps be tested on the basis of the lowest number of measurements. If 
this be done, as it is in test B, there is found very little evidence for non-linear 
regression. Certainly, one. cannot possibly assert that the low values of 7, which 
is seen throughout these experiments, is due to the number of ovules (seeds) per 
pod at first becoming larger and then decreasing after a maximum is reached as 
one passes from the lowest to the highest grade of seed weight. 


The results of Table I are also shown graphically in Diagram 2. Here the 
relationships for weight of seed planted and number of pods on the plant developing 
are also indicated as a basis of comparison. The values of both ry. and r,s are in 
general conspicuously lower than the low values of 7,,. But very few of them 
drop below the zero bar; one is forced to the conclusion that there is a distinct 
though very slight correlation between weight and ovules and between weight 


and seeds. 
* See K. Pearson, Biometrika, Vol. vit. pp. 254—256, 1911. 
. 10—2 


76 Weight of Secd and Characteristics of Plant 


Consider in somewhat greater detail the signs and magnitudes of these 
correlations*, 

Of the 26 values of 7, only 4 are negative. The mean value of the 22 positive 
coefficients is +°0673; the mean of the 4 negative is — 0236; the mean of all 
(regarding signs) is +°0533. 

For the relationship between weights of seed planted and number of seed 
matured per pod, 7s, 21 constants are positive and 5 are negative. The mean of 
the positive coefficients is + 0502; the mean of the negative values is — ‘0303 ; 
for all 26 correlations the mean (regarding signs) is + ‘0348. 

Thus both correlations are (as is clear from the diagrams) unquestionably 
positive but very low. 

Apparently the relationship for weight and ovules is slightly closer than that 
for weight and seeds per pod, but the difference is too slight to justify any final 
conclusion. 

Consider now the question whether the observed correlations ry, Ts are to be 
regarded as direct biological relationships between the two variables w and o or w 
and s, or whether they are to be looked upon as merely necessary resultants of 
other interdependences. At present, the only other demonstrated correlation 
which might tend to bring about sensible values of 7», and rys is that between 
number of pods per plant and number of ovules formed and number of seeds 
developing per pod. Since number of pods per plant is known to be correlated 
with weight of seed planted, while both number of ovules and number of seeds per 
pod are correlated with number of pods per. plant, some correlation must be 
expected between weight planted and number of ovules and seeds per pod. If 
now the observed values of 7%. and r,s which are always small, are merely the 
necessary resultant of the relationships 7), 7po, ps, one would expect the partial 
correlation coefficients, ,?0, pws, to be sensibly zero. If these partial correlations 
are not sensibly zero, it can only mean that there is a direct (causal) relationship 
other than the one just considered between number of ovules (or seeds) and the 
weight of the seed planted. 


The partial correlations and the correlations are shown side by side in 
Diagrams 3 and 4. The lowering of the degree of interdependence between both 
weight and ovules and weight and seeds by the correction for number of pods per 
plant is clearly marked. In a number of cases in which the correlation coefficient 
is positive the partial correlation coefficient is negative. 


Thus only 4 of the 26 values of 7. are negative, while 9 of the partial cor- 
relation coefficients have the minus sign. In only 5 cases is ry, negative, but in 
11 of the series, the sign of 7s is negative. The mean values of the partial cor- 
relations are very close indeed to zero. Thus j7y.='0186 as compared with 
Two = 0533; pFws= "0099 as against 7, = 0348. 

* T have already shown (Science, N. 8. Vol. xxxvu1. pp. 345—346, 1913) that the LL, LG and GG 
series are open to question because of the lack of certain precautions in the cultures; while they are 


included in the table of fundamental constants to avoid any possible criticism of selection of series they 
will be left out of account in the following disctssions. 


Ci 


J. A. Harris 


26. 96 GG va 


*spe0s IO} IOMOT XIS Ot} ‘SeTNAO 


€6 G6 


‘spun Burytom ur payunjd spaas fo 14620 Af 


LG 606: 26ley Sin 2b Obs Sh Vl SL 


ol 


Wa 0 ne, Oreegee se 9 8G v 


Joy orev sour toddn xis ayy, ‘pajyueyd poss jo yysIea wo 


pod zed speas jo pus pod sed sojnao jo worssorsoyy 


‘TL NVYYOVIG 


9 


*spaas 40 sajnao fo Laqunu uvayy 


Weight of Seed and Characteristics of Plant 


78 


- -- - --HHN- 


“|- - --- - -SSd- 


6 dn 
: 


"(= soul, UayoIq pue sapoto £°".= saul] ULI puUe seToII0 
“‘SuIdo[aAep S[VNPIAIPUL Jo sol4staoqovIeYyo pue pojyUTTd speos Jo yYBIOM AO SUOVI[eIIoD UMOUY oY} JO UostIedmMog 


L=Saull WAG pure sjop prfog 


- - -HHDD- 


---- - -GDp- 


‘6 WVdOVIG 


GO-— 


00: 


gO-+ 


0G: + 


Go-+ 


*$7W0191J909 U01)V)aLL09 JO san)v4 


79 


J. A. Harris 


' 1 ' 1 ' 1 1 ' t 
7 m™momom-mdeededed«cdeereteeéeuzxzAz2zdg 
Nnnoannenrn nnn npnAnin Ss FPS dS = 
So Sos 2 YG Seas SB use Zz Ss Sts 
To Ge rite Se Gemeee higw. 4 a ae 
| ! | | 1 1 ! | ! | | ! ! 
\ | | { | | | | | ! 
| 1 | | 1 { ! | | 1 | 
| ' | | ) | | | | ! | | 
' | | ' | | | 1 ! n 
\ \ ! | I | | | | yi 
| ' | | | \ | \ | | | fel 
| | 1 { | 
| | | ) 1 | ! | | i ‘h 
| ! | \ ; | i 11 
| 1 ‘i 5 
mR | | f | i “y ly 
! | A | | ' | 1 My 
4 | R | ia 
| ] ‘ er ae a eq al 
| | | \ eZ \ | | rip 4 
| | | 1 \ eet ee ye fan ee 4 
cog | 7 \ 
| \ } ! ! ! ‘6 fey ais | 
I | ] | | | | ! ' Su | 
| | } } \ } ! ! ! 3 | 
| | | | | | ' 1 ! ' ' 
| | ' ' | 1 | 
\ ! | | | | | 1 I | ! | 
| | 1 \ | | | ! I | ! | 
| | ! | | | | 1 1 ! 1 ! 
1 1 | ! | 1 1 1 | ! | | 
| | | | | | | ! | | ! | 
\ | ] | i \ | 1 | j | | 
! | | } } vo tl | 1 i 1 | 1 \ 
! ! ! ! Wah] | l 1 ' i ! I ! ! 
| ! | \ \ Wi a! | | 1 ! ! | | I | 
I | | | iol | | | \ ! 1 | \ ! 
\ | ue ' ! | | ! | ! | 
| | | ] Po) | } l | | } | 
| | | | | | | | | | | | 1 1 H | 


m= saul, prjos {°"4= seul, WaxoIg 


- - - - GHN- 


- ---~- - --HHN- 
~~ =. --- ~~~ -GdDd- 


| 
| 
| 
~e 
| 
| 
| 
| 
| 
' 
| 
' 
! 
| 
l 
| 
| 
! 
\ 
| 
I 
J 
L} 
| 
! 
I 
| 


- - 2000- 
---~=~----- QDD-- 


OL:— 


cO:- 


00: 


GO:+ 


O03: + 


-yuejd aed spod jo aequinu 4ueysuoo roy UOTyRIeAIoo [eyed 9} YIM pod azod soynao pue payue[d pases Jo JYSIOM Jo UOTZBIeII00 Jo UosTIvdWOQ “g WvAoVIG 


*squaioyjaoa fo anjpvg 


Weight of Seed and Characteristics of Plant 


80 


' ’ ' ' ' L 1 | I ' | ' 1 ' f | A \ t i ' ! 1 1 1 
mal | saghemesl seolecs feuh a Fy = a = = 
ge Efe ee ee ee wee Se See See SS 
i = 1 S ¥ = ' | } = = S i S or = a i] 4 \ = N a 
| ! | | | | | | | | | | | ! | | ! ! | | I | i | 1 OlL-— 
| | | | \ 1 \ | | I ] | | | | | | | I \ | | | ! 
| | | | \ \ \ | | | | ! | \ | \ r | | \ | | | | ! | 
| | | | | 1 | | | | | ' | | | | 1 | | | | | ! 
| \ | f | | | } I \ | \ \ } ' | | ] | | | \ ! 
| | | | I! | | ' ! | | | | | | ! a | ' | | | l co-+ 
| | \ \ | | | | \ \ | | } 1 \ | ' | ‘ \ \ | 
! | R ' | | | | 1 } } } | | | | DAS \ | | | \ | I 
\ \ ll’ | | A | | | \ \ t Hl he IN | | ’ \ ! 
/,\ Lig tw \ | | ' Aas Cee, | 5 aN | ' \ 
L I 1 \ I | d | 00: 
Yo Vi iene ] Sey =, fi ] a iN Aiba 
joe \ / | \l | | | GN : ran | ay | | \ je db 8 
fel | a, \ Q \ | ei ee | 6” le | (ie | | \ Pn ot 3 1\ 
/ \ / AS \ / 
\y | i ise\e ] areal Re, Nee eal Lv tau | | ‘ ie Mile ae 
a‘ | ens | Ms f | | | : \ p--6 ! | \ hi | Mi / \ co-+ 
\ | ' loa | fj \ | | \ Cal , \ | | \ f \ y 
| 1 ! | | | I 1 | | \ls | | \ | 'e' | ! 
| | i | | g \ \ a ai 
/ ' \ oO 
} } } t Q f ;'! | } | \ } (ema | | ! 
| ! | | 1 TEX, | Teal ! | ! 1 | ! \ | | | | \ | | le 
1 ! ! eS ete De } ! | | } ] ' } \ i ' Ore 
\ ' } | oly \ | | ' | } | | 1 | ' | } 
| \ Y | ie 
\ } | \ | 1 ' I \ \ \ } } | | | \ | ] \ \ ' i 
| | | | I i i I | | ! 


f= sautt pros £*"4= souly uweyorg 


‘yuetd sod spod jo sequin yuvysu0o toy uoNeper10o [eyred oy} YI pod red spaas jo requInU puUe payue{d poos Jo yYSIAM Jo UOYRI[eII09. JO UOsTIBdUIOg “F NYWOVIG 


‘squarayjaoa fo anjv4 


81 


J. A. Harris 


| | i} | 
= a a a = | | 13 | | 24 | (es) 929.—008. 
| al iz | €9 9IL | 03 | (@e) 008.—¢LZ. 
5 aoe ace : 3 : | LL | 961 | €3 | (Ze) $£L-—04. 
ge eae ema ee eal 1 ge | Gia | ae eg ae lp | B6E | 86L | LPI | (08) 094.—GéL- 
| | = | : ZeG | BEE | BST | (66) 9eL.—00L 
<cril hart Gee cam (eizacalt (oat = GE ag , = ae PSS 26. 8a) 00L:-—EL9- 
= le ef ba == = WG ZL |G = =| = 5168 14 «|G &F ro «6 Tg | BIST | g@ | (Zé) ¢49.—0g9. 
an ee et alee AG £7 | 8 €% 66 | 8 PL 16 81 | 2g OL | It | TOT | 49T | O€ | GEL | OOST | 9Gz | (94) OG9.-—¢ 
= om al Ode 8 | te al = ae ee SOL | 6T | 06 | €46 Gh | SFIT | Bhs | 9OF | (2e) G69.—009. 
= ay ly eer A96 PST | of | 69 6GI | PS | SIL | SEL | 8S | GST | OGL | Le | PFE | GOP ZB | IGHL | Lees | Est | (7s) O09.—<Ez¢. 
PP 29 | IL | 69 6IL | # | ¢9 Q9IT | FS | OLL | 18% | HH | OFZ | COE | BE | GZ | BEIT | FOS | 6ZOS’ | OOS | €89 | (Fe) SLe.—OEE. 
rat Sl 6 941 | 166 | 9¢ | PIE | BES | LOL | FIG | OF9 | OSI | 86G | STS | SHI | E9ST | GOLT | TOE | SLITS | L1zZP | FeL | (ZB) OG¢.—EeE. 
ee 9¢ | IL | 188 | GLb | G6 | sce | 969 | BTI | 848 | OSIT | BIZ | OZOL | 9OET | TEs | ZOBT | GehS eh | 69E% | BLOF | IBZ | (Te) Se9.—008. 
GL 6II | 93 | eZE | EE | GOL | 9GE | SFO | BSI | SZFI | OBBT | BGE | GIST | OFGI | ZOE | ZOLS | 9B9E FO | FEST | LHTE | TE | (02) ONG.—ELF. 
LOL | 146 | LG | PEO | wee | 68T | 8zG | F98 | OST | 86ST | Z6GT | TEE | T99T | LETS | EOF | BBZS | GEG SIG | SETI | 6rZS | FLE | (67) Heme 
ZIG | Il€ | #9 | EFS | G98 | OLT | OTF | 769 | OFT | GLOG | BGgs | E6F | BIOL | 9FIZ | LEE | 9ZOS | Gee TOF | BOET | HEGs | SIP | (ST) ¢ 
Ore | LIG | 9OL | 89F | GOL | 9ST} FOS | 006 | 8BT | GRFI | ISBT | LEE | PeSS | GELE | 9B | LOTE | 6GOF | IZL | FZET | OBLE | FOF | (27) G 
€6€ | G9 | L21 | IPE | IhS | SOL | Sch | GOL | IOT | IGOT | G8ET | P9Z | ZOGT | LE0S | FOF | PGES | PZTE | I9G | BIE | E86 | LLT | (91) O 
Ter | 6IL | PST | 88S | BOF | 6 | OLB | Lb | GOT | EEL | 868 | ZAI | FEFT | BEST | L9E | GIST | EZ0S | SSE | 6Zh | 806 | FET | (ET Nee eee 
OSE | 9FG | ZIT | BIZ | 49€ | GL | 9EG | SIF | 48 | OGE | GE | GOL | OZE | ZEIT | 6zs | O€ET | EGLT | Lee | Ihe | OGL | GBI | (TT) OGe.—<see. 
ees | Oe | GL | LLT | 966 | 09 | 902 | LIP | 78 | Ere | PID | GL | EGr | BLG | GOT | OSE | BET 126 | G9 | ILT | 8% | (EI) 9e6.—008. 
LOL | 41 | 9€ | ¥g SIl | 3 | 26 LVL. \ SE = = |e TLS 0eG? | TOR” SLOT: eet 1.008 LE 8 (1) 4 008. 7 846. 
8% 6y | st | eg v6 «| 6L | PF 6 | ST et er INCL ACLG I TGs acOG IsGer a Pe. 12Se €¢ re 
91 we | ial r «| 81 pe | 8 yea € 9¢ z9 €Il | 261 | 198 | Lh | 1% If i 
&I ice | = eee amar inh 4! 8. |? as rT: 1906s epee (OO. | Pet. 1Gs == == eS 6) cee. —008: 
¢ Go Uli ley | ee —{—}|—|—!— |—|e |e |st-|ze |w le = | = |, — 1s) 008-—eur- 
9 8 z 2 oa : 5 | | - |(4) $L0-—09T- 
= 9 = | | (eo eoa Ze 9 = — | — |(9) 0g1.—géT. 
Pace > | So Sean a = = tyes ala Mtr at ly Glee = Sh, Ae CO) RC a Te OOK: 
| ag ee - | | (1) O001-—¢L0. 
aaa ei = | | ee SY Cac: 
| = | | | | 
spoag |seTAg) speeg soTMAg speeg | seTnaQ speeg | se[nag speeg |sernag speeg |seynag. speag |setnaC 
eon og, “POT peso, | wwsog, “PPE | peso, | weyog, | “PA | qeaog, “ero, | POT] pequz, | reso, SP°E | pear preog, “P°F) pero, | rein, | PPE | poquera poog 
eh 3 
Ad)p/ seg ZAIN Satay ADD ser1ag HHDD saveg HDD SaMag HDD Serag DD seueg ee 


SOR MECMESRAD 


1 


Biometrika x 


Weight of Seed and Characteristics of Plant 


| | | 

= =o —= | FI € == = == | 61 CE pf = = | Ay 09 OL | gL SOL | 0G = = | = IO C= 
9 8% 9 |8¢ ZOL | && = aS OG €F ja an ata = = li Oy cg | OL | al | SOT | O8 GF 24 OL | (sé) ¢4¢.—0¢¢. 
SI It ge Lee SiL a = =|) fe Ee OL | = = — | 02 CP OL | 6FS | SEE | O09 TP F¢ OL |(é@) 0¢¢.— Gee. 
09 161 | GS |16@ | OOL | FET etait eecg | semaine BE 8. idh = — — | 9rr | Gtg | OIL | FZG | OTL | BSL, | SOL | Gs | GE (Te) Se¢.--008. 
‘0g r9 | S&L |eSel| 14% | PEG = oa = | Gy O€l | 46 | TE 1¢ OL | 68 | 66€ | 9, | ¢g9 | G06 | O9T | 80E | Oar | 84 |(04) 009.—FZT. 
SEL | OLE | Z |E9FL) 9886 | F9G = = — | 68 IST | I¢ | €6— | 691. | I¢ | 64h | 209 | GIT | BIG | 68ZL | 6ze | 68E | GEG | 76 |(6Z) SL7.—OGF. 
gt | 9EF | ZB | OtES| OOFF | F98 = ol oy, 9e1.| 8c | 981 | EPS | HH | IIS | LTO | OBL | OSL | 686 | ZBL | Ser | 8G | POT |(Sz) OG%.—<er. 
191 |. 46 | LL | Gees] SotF | 188 = — | — | 96 | GAT] 98 | BOB | Tee | GO | Heo | HHL | HHL | T6BT | 99BT |) GEE | L8G | GGL | PEL LT) G6%-— 007. 
€96 | €09 | GSI |10GE} 6GL9 | LI€T] FT 9% G BIT | 0€@ | Lh | BI8 | FELL | SIZ | FOOT | 9BET | 49S | YBOT | FZEE | IP | LOST | FOOT | HES |(9T) 0O4-—FL 
PLE | LG9 | FEL | PEES| ZO9F | €z6 | 08 IGL | GZ ZOE | 089 | SPL | OLIL | SFE | OBZ | 9646S | F16G | 18¢ | TEOE | B8zr | TLL | LPFL | GS6I | BPE |(47) ¢ 
I9L | Zlgé | 99 | 988s] Goer | L416 | LET | 98% | Ge PEF | ZOL | I9T | SLST | 6L0G | 96E | ZLOB | L69S | IGG | OGIF | LET9 | OZII | GOSS | 096G | GES TT) 04 
Sze | COF | 148 | TFOL| SZIE |0Z9 | 66% | ZOF 26 LOG | ZO | GEL | FOBT | LOT | BIE | EL9B | BFVE | 899 | LIL | 6899 | LOZI | OF8I | B8FS | FHF ($1) & 
ect | 90¢ | 19 | L2z1| eres |oor | Bae | LPL | BPE | 98% | OZ | ETE | 449 | 1e6 | BLT | OGGT | BeOS | 96E | BOSF | GIZO | YETT | GIST | LZ9T | 86e | (67) 01 
19 zt |9z2 |e29 | 90et |79% | 269 | 8I0oL | 916 | 62 | z2eL | ze | 8¢9 | $8 | 291 | GOOT | ZZ | TEs | g9zw | ese | Lgg | IIS | €OL | OBL \(TT) eLé. 
cL Lee ied, 16Z | €€F1 |98% | LOG | 19L | O9T | OT | Ee L FOL | PET | 8G | OLE | 9L4F | €6 | O49 | LG6 | OAT | TB 981 |G |(OL) 
Il 16 | & ele | %9 | Zot | @2% | LOV | 16 6% FG Il mae = =| ay 0¢ OL | F8% | 10F | EL GL. &6 SL |(6) 
i 2 | [193 | 97 | 06 Ig | SI | ge | S60 | eS) |) OL a TN ia So OG er Obes or thecal (2) 
OT 1 G 1e 2g. ST. 1g £8 LT = = | 6E 1¢ OL = a a it seal) 
eel. || oo = | OG a PO Tt Tee LG L . | >= Is} «OT € |(9) O&¢1.—96T- 
= = : = iB | 8 G |(¢) &6I-—ooT- 
— = — = = a = | — |ct | 26 |¢ |\(%) 001-—¢L0- 

= | _| c -| | eI | = = 

| i | 
Speag | saTnag spod speag | seTNAGC, spog spaeg |seynag spog spsegq ‘sa[nAQ spog spaeg |seTnag spog Spoeg |se[nag | spog Spe9g | SeTNAg | spog spseg SIMA spog | 
TPIOL| TRIO, Teqoy, | TeIOL TROL | TOL TBIOL | [4ON, | TIOL | Tei", TOL, | TOL TSIOL | TOL THIOL | TOL payueld peg 
: Saeaearae aianaal a =e |e = = : : jo 
D'T sated TT sattag dIdsn sears dS sees HHS) seg HS) sates Ss sereg DSN selteg ISIE AL 


“AI WIV 


83 


J. A. Harris 


| | | | | 
lab = agit ce | = | ep | 6S OL | = | (GL) GEE-—008- 
== —\ = IST PST 42S 1G rer 0s G6I  18@ | 0¢ 6FE | 69 66 | OFT | OSL | O& | (@I) 00e.—FLé. 
id G | |68t | 796  9¢ | G6E | EEF | T6 LOOT | OSE | 9FZ | FSOT |EBLT | FIE | O99 | LEG | SGT | (ZT) $4é.—o0ge. 
991 | OIG | IF | 129 | LFL | GFL | Beez | 966z | OFS | OGEz | S6FE | FZO | GO9E | Z9TO | LLOT | Z9GT | GBIZ | FEE | (OF) 0Gé.—géEe.- 
069 | F8L | GGT | COST | OGLT  19€ | 68ZG | OOS9 | OBIT | I8¢g | EOL | BHI | E98G | E8OOT| ZOLT | GZEE | GLP | F9L | (6) Gee-—00E. 
L8ET | GLLT | OGE | OOLT | EGFS FEF | E899 | LEZ | OGFI | FESF | ZOTO | LGOL | OOTL | ZEST | B8IS | 99FE | O66F | zB | (8) 00G-—EALT- 
BEES | INGE | 9L9 | TS8 | OOSL | SFZ | OE9T | 9006 | GE | EEE | OLTF | ISL | FOLF | LEFS | OGFI | GEO | 9882 | O17 (4) GLT-—OST- 
SOIL | O€FI | 98% | Gzz | e6z | 19 | O6F | BIO | SIL | FAT | E413 | lor | LTS |9PSE | 9E9 | 908 | SEIT | BET | (9) Og7.—<SeT. 
GSI | FOS | OF | LOL | SEL | O€ | ¢8 ik |) Xe 98g | 99F | £8 FI9 |FSOL | F8L | G6z | OZ | OL | (e) SéI-—OOT. 
81 CE 57 FZ cr | 6 SF 6¢ OL LP 99 | €L Ol@ |6EE | 6¢ 61 LG ¢ (f) 00 
au jar ie |g 06 «| F = | | 93 |ce |¢ |e) ¢20—090. 
| | | | | 
| | 
speeg |soqnag| Speeg | seTnag spoeg | sayna | Speag |sayna | spaeq | sana | spseg |serna | 
TRIO], | TBO], | Pd | Teqog, | eyog, | POT) peqog, | ei a len Teor, 1s “Teroy mor rps Tmo mon, ae poqurtd paeg 
on Me eee — i O 
WIdS a Satr9: 5 reRIOM 
TAS Setteg USI SelIeg HHS sates HS seveg SST Seteg OSA Seueg 
| | 
TA WIV 
2G ecg pee es Se hae ORs, 8h wy 6 =| | SOOT oes: 
Bai ae | Bo) ee ale ; 6 | Go |e |eOT |9Zt |08 |rSt |ree [47 | (G7) o26.—0S8. 
| = 96 | PIL |6L |6IT. | S6L. |6€ |Z9L |Fee |Or |F4s |Ier | es (FT) O98. see. 
BOURIGSLC® | ee ee 1608 cee | eT | Nit | == Rem Werrenies "iiece* | egy ALL: | Sees iy | 8h VevOl | Tlel A Gre mlelees 
vee | Gv |I8 (98 | BOL |OS |6cl | FET | 6E {ET | SI P I8¢ | LFL |9FL | G9zL| LFEL | ZIP | 890% | £09 | 99F | SegE | TLgF | FOG | (ZT) 008 Che 
6891 780 |¢6e | cor | 147 |98 |zz¢ | oL9 | OFT |OLT | Tes | Ge | LEFI| SGeL | FEE | PEGs] ErEE | 9S8 | GES | FFEL | ELET| 8668 | SZSIT | 180G | (77) seo Tb 
8467 | 8009 | GOIL| F8PT| GET | Zee | 9OGT| ORGI | OZF |6EG | GEL | GOT |G8LZ| BOLE | COL | OGLE] GGLE | EST | OSLTT | LTOFT | TG2E | GOLET | SEILT | ASTE (OL) 096 
E969 | OGGL | SLFL | F9SP | EGTS | G86 | BBIS| L06G | F19 |ETIL| TOST | See | LE8F| EEP9 | GOT | VEGF | LEFL | ZEST | BOLFT | CEBLT | FEE | GIZIT | €9G0G | LLLE (6) 9% 
€909 | GFEL | PIPL | 0964) FBLE | LOGL | CF8T| ZFS | FEE | ZELT| BOFS | ZOE | FLIG| G89 | SLET | LBzE| 18LF | LOOT | 8086 | GEEET | 9LGS | ELBOT | SLOFT | 109G ($) 006-—@LI- 
LGPL) I9LT | GPE | S60L| 8F98 | O69T| SIL | O16 | G6 | L0GT| LZFO% | OS | ITE} GZIS | S66 | G06 | 98ET | 16S | Tere | eeEr | Z6L | ILLe | F8Lr | O88 | (4) G47-—OST. 
Teh | Geo | 801 | PLrS | E1Ze | ZrO [ect | Fes | Ly | LL | 189 | IGE |6TET| 9OLT |6rE | ZBI | 64a | 2G |9TE |6IF |92 |698 |FGIT | Ges | (9) GOT 
LF TG j|OT |/948 | 9801 }ZIZ |8¢ | 08 | AT |G | 692 | G9 |e61 | Tea |7e | 26 |B |OL = AS LOL |€@ |(¢) $8I-—ooT.- 
cam ee — |60¢ | sce |1¢ Jor LT Foe eon eOsel OGaalaCia mG Sa |e See | = = Al 1 G  |(%) 00T-—¢Z0. 
8 8 j REO Sa RSE Wee Wh aie gl hc cemie Nfpoere 0 aa Attest ert RO belle fi f 8 z — |(§) $40-—090. 
| ; | 
Sp9ag | seynAC Spaeg | sarnag spaog |sayua speeg setna | spaeq | sayna spoag |seyna speed | satna e speed | seyna 
rerog,| peyog, | POF | teqoz,| pegog, | POF | eyo hn, EPOd. Teron lain eet ea “nae, seed Gon on, ee OL Teor, a mor mod, Ped) neque paog 
: | | 
L | ‘a 
HHG SoLtag Hd SOTLlag ddg S9IIeg dd Sollag ddH Seltag dH Sel1ag HHH Seltag HH Solteg FUSION 


ll 


‘A ATEV 


84 Weight of Seed and Characteristics of Plant 


IV. RECAPITULATION. 


The facts presented in this paper and in the preceding studies justify the 
following conclusions. 


1. In Phaseolus vulgaris there is a sensible relationship between the weight 
of the seed planted and the number of pods on the plant developing from it. The 
correlation is always low, averaging only about °166, but under proper experimental 
conditions the coefficients have always been found to be positive. When experi- 
ments are not made with all necessary precautions substratum heterogeneity may 
completely obscure the influence of seed weight, reducing the correlation to 
practically zero or even bringing about a substantial negative correlation. 


2. There is also a significant positive correlation between the weight of the 
seed planted and the number of ovules and the number of seeds in the pods pro- 
duced by the plant developing from it. These correlations are so low that on 
relatively small samples negative values may be found. They average only about 
one-fifth to one-third the magnitude of the correlation for weight planted and pods 
per plant. 


The relationship for weight and ovules is numerically higher than that for 
weight and seed, but on the basis of the number of series now available the 
difference cannot be asserted to be significant. 


3. Morphogenetically and physiologically, the observed correlations between 
weight and ovules and weight and seeds are to be regarded as the resultant of two 
other correlations, namely, that between the weight of the seed planted and the 
number of pods per plant and that between the number of pods on the plant and 
the characteristics of these pods. This conclusion is based’on the fact that the 
partial correlation coefficient for weight of seed planted and number of ovules or 
seeds per pod for constant number of pods per plant is practically zero. 


CoLtp Sprina Harpor, N.Y. 
August 20, 1913. 


ON THE PROBABILITY THAT TWO INDEPENDENT DIS- 
TRIBUTIONS OF FREQUENCY ARE REALLY SAMPLES 
OF THE SAME POPULATION, WITH SPECIAL REFER- 
ENCE TO RECENT WORK ON THE IDENTITY OF 
TRYPANOSOME STRAINS 


By KARL PEARSON, E.RS. 


(1) In Biometrika, Vol. vin. p. 250, I discussed fully the mathematical 
process requisite for measuring the probability that two independent distributions 
of frequency are really samples of the same population. As far as I am aware this 
is the only complete theory of the subject which has been published. I believe it 
to be scientifically adequate, and it has already been applied to a large number of 
problems*. 

Before that paper was published, it had been usual to compare any constants of 
two frequency distributions together, and by a due consideration of their difference 
relative to the combination of their probable errors to determine the probability of 
the identity of those constants. This could be repeated for any number of corre- 
sponding constants, and if theoretical curves of frequency had been fitted, their 
divergence or correspondence measured by the divergence or correspondence of 
their complete series of constants. The method above referred to, however, as 
based on the general theory of sampling, calls for no hypothesis as to the general 
theory of frequency. It takes the observed distributions and measures the prob- 
ability that both are samples from a large population. The population may be 
homogeneous or heterogeneous; provided the samples are truly random samples 
we obtain a measure of the probability of their common origin. 

In the course of a long statistical experience I have learnt that it is wholly 
impossible to reach any safe conclusions as to the identity or non-identity of 
populations by any process of mere graphical comparison of frequency distributions. 


* In actual practice the x? test of ‘‘ goodness of fit’’ should always be made with not too fine group- 
ing at the terminals, especially when any group in the tails appears to be contributing largely to the 
total of y?. This point was recognised ab initio (Phil. Mag. Vol. u. p. 164), and has recently been 
re-emphasised by Edgworth, Journal R. Statistical Society, Vol. uxxvu. p. 198. 


86 A Study of Trypanosome Strains 


The distributions in appearance are wholly dependent on the choice cf scales 
and the eye alone cannot possibly make any measure of the degree of accordance, 
which will have scientific value. 


In the accompanying Diagram I. for example, we have the frequency distri- 
butions permille of two strains of trypanosomes, (aa) from a Donkey and (bb) from 
a Hartebeeste. These we are told are identical. Below (cc) and (dd) are given the 
frequency distributions of head-breadths for two races, Egyptian and English women, 
separated by 7000 years interval. These strains we know to be different, but the 
eye that judges (aa) and (bb) to be the “same” * might well suppose (cc) and (dd) to be 
also the same. Actually when we come to the quantitative measure of divergence, 
the probability that (aa) and (bb) are samples of the same thing is P < 000,000,1, 
while the probability that (cc) and (dd) are the same is P=‘001. In other words it 
is 10,000 times as likely that Egyptians of 6000 B.c. and the English of 1680 a.p. 
are the same strain as that the trypanosomes from the Hartebeeste and those 
from the Mzimba Donkey are of the same strain. Both may indeed be of the “same 
strain” if a sufficiently wide meaning be given to the term. But is such a racial 
resemblance as we find between the Prehistoric Egyptian woman and the English 
woman diluted 10,000 times what we understand in ordinary language by the “same 
strain”? All the mathematician can understand by “sameness of strain” is the 
identity which corresponds to random samples of the same population. Ifthe identity 
has been modified by a long evolutionary process, by markedly differential environ- 
ment or treatment, is it not better to have some measure of a scientific nature of 
the extent of the difference or of the sameness? The eye can never provide any 
judgment of value on such a point. Especially is this the case if the graphs 
represent percentages, as the degree of divergence is of course a function of the 
number employed to determine the percentages. A deviation of frequency by per- 
centages based upon samples of 200 might look to the eye absolutely like the 
deviation of frequency due to samples of 2000, but the scientific measure of the 
probability of sameness would be widely modified. 


That the reader should have evidence how excellent is the test, I have taken 
the cranial lengths (Flower’s measurement) of 67 female skulls dug up in Liverpool 
Street and compared them with the like lengths of 142 female skulls dug up in 
Church Lane, Whitechapel. It is possible that both these sets of crania formed 
part of the contents of plague pits, or there may be an interval in date of a century 
between them+. Diagram I bis shows the data arranged as percentage frequency 
curves. The x? for 17 groups proceeding by 2 mm. ranges = 19°38, giving P = ‘250, 
or once in four trials, if the material drawn from were the same, we should obtain 
pairs of samples more divergent than the pair recorded. In other words we can 
be confident that the Liverpoo! Street and Whitechapel crania represent persons 

* An attempt to define the word ‘‘sameness” as used by writers on trypanosome strains would 
doubtless serve a useful purpose, and emphasise the fact that we can only define ‘‘sameness” by appeal 


to the theory of sampling, or by the adoption of some quantitative measure of the grade of likeness, 
+ See Biometrika, Vol. 11. p. 191 and Vol. vy. p. 86. 


Kari PEARSON 


MICRONS. 


15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 


els SeReRa es 


jHRUBERSAEEESRERRRREED 
Sone oe e 


\ \ 
aera HE THEE 


aa, Mzimba Strain of Trypanosome, 


bb. Strain frcm Hartebeeste. 


MILLEMETRES. 


c. Cranial Breadths of Egyptian Women 6000 B.c. 
dd. Cranial Breadths of English Women 1680 a.p, 


Dracram J. 


88 A Study of Trypanosome Strains 


MiLLEMETRES. 
170-5 180-5 190-5 


a 


BEB Ae 


ofA a FS 


cc. Whitechapel ¢’s. 
dd. Moorfields ?’s. 
Diacram I bis. 


of the same strain. That P= ‘25 and not, say, ‘85 may be merely a result of 
random sampling, or it may arise from some difference of period or social class. 


(2) In a long series of papers recently published in the Proceedings of the 
Royal Society, Section B, conclusions are reached as to the identity of various 
strains of trypanosomes. These conclusions are largely based on a comparison of 
graphs of the frequency percentages obtained by measurement of hundreds of 
trypanosomes. 

To some extent mean values are given for the different strains, but no argu- 
ments whatever can be based on them, for in no case has the probable error of the 
difference been calculated. Even if it had been calculated, this constant alone 
would not have sufficed to determine the sameness or difference of the strains. 
Further, the percentages of various forms in the strains are sometimes given; 
but again no attempt has been made to determine whether the differences of 
these percentages are or are not significant. It seems sufficient here to consider 
the far more valid test of the sameness or diversity of the frequency-distributions 
as a whole. 

I shall divide my investigation into four parts: 

(i) The probability of identity of the strains on the evidence presented in the 
reports of the Commission of the Royal Society, Nyasaland, 1912. 

(ii) The probability that the host or the animal in which the trypanosome is 
cultivated makes essential differences in the distributions of frequency. 


Karu PEARSON 89 


(iii) The probability that the strains are alike after allowance has been made 
for the host. 


(iv) The nature of the heterogeneity which is statistically demonstrable in 
the bulk of trypanosome measurements. 


I should like before considering the material to indicate one or two very 
important points. I am not concerned here with the truth or error of the con- 
clusions drawn by Sir David Bruce and his collaborators. I am only concerned 
with the nature of the process by which they have drawn their inferences. That 
process consists in a measurement of the individual trypanosomes and an appeal 
to the statistics of these measurements—in short to what I should term biometric 
reasoning. There may well be other means of discussing the resemblances of the 
different strains of trypanosome,—either by microscopic examinations of diver- 
gencies in the life history of the different strains or by differentiation in their 
action on different hosts, or otherwise. But in the present case the appeal to statistics 
of measurement has been made. Drs Stepbens and Fantham in their paper on 
T. rhodesiense (R. S. Proc. Vol. 85, B. p. 227) actually term their work a “biometric 
study,” and the later papers of Sir David Bruce and others are no less “ biometric.” 
Now if an appeal be made to statistics, then by a statistical method alone can the 
answer be given. Further, that method must be the analysis of the modern fully 
equipped and highly trained statistician. Such a statistician, and he alone, can 
assert or deny on the basis of statistics the probability of any of these strains 
of trypanosomes being samples of the same population; he alone is in a position 
to judge the value of the evidence provided by the frequency distributions. If he 
finds substantial “divergence” where Sir David Bruce and _ his collaborators 
assert “sameness,” then either statistical theory is wrong, or Sir David Bruce 
understands by “sameness” something quite different from the “sameness” of the 
statistician, and something which cannot be judged by the methods of statistics, to 
which accordingly no appeal should have been made, or only an appeal after a long 
series of control experiments. The “sameness” postulated by Sir David Bruce is 
something quite incompatible with the “sameness” found by the statistician when 
he investigates two samples of 100 crania of the same race or two samples of 1000 
blood corpuscles of two series of frogs of the same race. It is what the statistician 
calls marked divergence and not sameness. If it be asserted that the extreme 
divergence actually existing between the strains of trypanosomes statistically dis- 
cussed is due to difference of individual host and not to difference of strain, it will 
be clear that the divergence and not the sameness ought to have come out of the 
statistical investigation, and then control investigations ought to have been made to 
explain that divergence by environmental or other differences. But this is & priori 
to assume the identity of the strains and a posteriori to seek an explanation of 
marked divergence deduced statistically, whereas in the actual papers this great 
divergence is assumed to be statistical sameness and this sameness used as an 
argument for identity of strains. The statistician coming to the data critically 

Biometrika x 12 


90 A Study of Trypanosome Strains 


does not of course assert dogmatically that any two strains are not of identical 
race. What he does assert is that no argument for the sameness of the strains can 
be based on the statistics provided; for these actually show wide divergence, and 
he asks if the strains are @ priori assumed to be “same,” for a full @ posteriort 
examination of the sources of the divergence. 


The scope of the present paper is not the complete investigation of all the data 
of the Royal Society Commission, nor an endeavour to obtain from the published 
data the full conclusions which may be legitimately drawn from them. Its purpose 
is to illustrate the statistical methods which ought to be applied to such material 
and to indicate the essential necessity of control experiments on strains known to 
be the same or accepted as different. A point should be noted here, namely, that 
I have only found two cases where the strains on the basis of the statistical 
evidence are said to be different. The first is in the case of Trypanosoma evansi 
and Trypanosoma brucei. Sir David Bruce* gives (1911) the frequency distri- 
bution of lengths of 820 individuals of 7. evansi and compares it by means of a 
graph of percentages with 7. brucei. The percentages of the latter appear to be 
deduced from the lengths for two series of 160 trypanosomes and 200 trypanosomes 
cultivated in a variety of animals (Uganda, 1909, and Zululand, 1894) and pub- 
lished in the preceding yeart, but no reference is given in the paper to the 
original of the percentages in the graph, nor is any demonstration given in the 
paper of 1910 of the statistical sameness of the Uganda and Zululand strains— 
there is merely said to be “ marked resemblance{” where the trained statistician 
finds marked divergence§. Stephens and Fantham|| use the curve of 1911 to 
assert that there is a “general resemblance between the curves representing 
the measurements of these trypanosomes (7. gambiense, T. rhodesiense, T. brucet)” 
and consider that this “general resemblance” shows that “the method is a 
trustworthy one.” It is not clear what “the method” referred to really sig- 
nifies. The statistical comparison of means and maximum and minimum 
lengths without statement of probable errors, and the mere graphical exami- 
nation of frequency curves are wholly inadequate to determine sameness or 


R. S. Proc. Vol. 84, B, p. 186, 1911. 
|+ R.S. Proc. Vol. 83, B, pp. 5 and 11, 1910. 
R 


| 


+ R. S. Proc. Vol. 83, B, p. 12. 
§ The two distributions are as follows: 
oa | 5 fe 
|} 13) 14|}15| 16) 17) 18 | 19) 20 | 21) 22) 23) 24| 25 | 26 | 27 | 28 | 29 | 50 | 31 | 82 | 83 | 34 | 85 | Totals 
(hers jth, elie lice ee i 
Uganda, 1969 |—|—] 1, 2} 4} 6/10/26 14/14/12} 9]12) 6} 6/12/10) 7] 1) 2/3) 3)—/ 160 

| Zululand, eee a 3} | 11 | 11 | 20 | 32 ma 4} 4) 3] 5/3] 7] 7 |10/13/13] 8 |10} 8} 3]1] 3] 200 
| | | 


These give x2=101:18, leading to P<-000,001, or not once in a million trials would two so divergent 
distributions be obtained by sampling the same population. 
|| R. S. Proc. Vol. 85, B, p. 233, 1912, 


KARL PEARSON 91 


divergence. Only a month later than Stephen and Fantham’s paper appeared 
another paper by Sir David Bruce and others* comparing human trypanosomes 
from Nyasaland with 7. brucei and T. rhodesiense and T. gambiense. But the 
curve for 7. brucei is wholly different from that of a year earlier. Instead of a 
minimum at 24 microns there is now a maximum at 24 microns, and the “ general 
resemblance” of 7. brucei to 7’. evanst is much increased. We are now told that 
T. rhodesiense (Stephens and Fantham) is “a distinct species, nearly related to 
T. brucei and T. gambiense,” and the conclusion drawn that “the human trypano- 
some disease of North-east Rhodesia and Nyasaland is not the disease known as 
Sleeping Sickness in Uganda and the West Coast of Africat.” But the divergence 
between the frequency distributions of 7. brucei and the human trypanosome of 
Nyasaland when accurately measured is of exactly the same order as that which 
suffices to demonstrate the identity of the human Nyasaland trypanosome and 
T. rhodesiense. Thus the two cases in which divergence is asserted, i.e. (i) 7. brucez 
and 7. evanst, (11) T. brucei and 7. rhodesiense, seem to be differentiated largely on 
the base of unanalysed statistical evidence of a nature precisely like that which in 
other cases is interpreted to mean close “general resemblance” or “sameness.” 
We do not feel that we are in the possession of independent evidence of differen- 
tiation which would enable us to test how far statistical divergency corresponds to 
recognised morphological differences of strain,—a fundamental requisite if we are 
to interpret as “sameness ” a statistical divergence of an extremely high order. 


In concluding these introductory remarks we must refer to the types of 
trypanosome in Nyasaland recognised by Sir David Bruce and his colleagues as 
distinct on other grounds than numerical measurements. They are: 


(a) TZ. brucei vel rhodesiense. This is said to be the cause of the human 
trypanosome disease of Nyasaland. The modal length appears to be 24 to 25 
micronst. According to Bruce and colleagues 7’. gambiense appears to have a 
mode of 20 microns, but there is evidence for a submode at 26. 


(i) TZ. pecorum. This is said to be the cause of trypanosome diseases of 
domestic animals in both Uganda and Nyasaland. The modal length varies from 
13 to 14§. There is no statistical evidence of bimodality. 


(ii) 2. stmiae. This attacks monkey, goat and warthog. Oxen, dogs, white 
rats, etc., are said to be immune. The length distribution appears to be very 
homogeneous and with a single mode at 18 microns]}. 


* R.S. Proc. Vol. 85, B, p. 431, 1912. 

+ R. S. Proc. Vol. 85, B, p. 433, 1912. In 1913, however, we find that ‘there is some reason for the 
belief that T. rhodesiense and T. brucei are one and the same species,” see Sir David Bruce and others, 
R, S. Proc. Vol. 86, B, p. 407. 

+ R. 8. Proc. Vol. 84, B, p. 331. Stephens and Fantham’s measurements on 7’. rhodesiense 
suggest modes at 20 and 26. Ibid. Vol. 85, B, p. 231. The double mode—roughly 18 to 20 and 
28 to 29—appears in the Zululand (1894) and Uganda (1909) strains of T. brucei. Ibid. Vol. 83, 
B, p. 12. 

§ R. S. Proc. Vol. 82, B, p. 468, and Vol. 87, B, p. 14. 

|| R. S. Proc, Vol. 85, B, p. 477, and Vol. 87, B, p. 48. 


92 A Study of Trypanosome Strains 


Gv) ZL. caprae. This is found in waterbuck, ox, goat and sheep. The dis- 
tribution of length is apparently homogeneous and the mode at 25 microns*. 

I leave out of account several forms of trypanosome referred to by Sir David 
Bruce and colleagues, e.g. 7. vwaa, T. uniforme, T. ingens, etc., of which no large 
series of measurements were at my disposal. 


With the exception of 7. simiae, which occurs in the warthog, the above 
trypanosomes appear to be found generally in the wild game and all of them are 
found in the Glossina morsitans. Sir David Bruce and his colleagues suppose the 
differentiation into these classes to precede the consideration of individual strain, 
but the exact modus differentrationis is not clear from the memoirs. 


(3) Method of Investigation. The actual formula employed in the present 
investigation is very simple and can be applied by anyone able to do ordinary 
arithmetic. If N and N’ be the sizes of two samples and the corresponding 


frequencies : 
Joy Jas ise aise Tas eae Ss: 
as Us ws; wise) VEN ile. fess 


where fp, f, are the frequencies falling in the p™ category, then if 


(Io _ tv \? 
cou CEB 
| et Te, 
be calculated, the probability P that the observed or a greater divergence between 
the two series would arise from sampling the same population is obtained by 
determining P from y? by my method of testing “goodness of fit.” This method 
was first published in the Phil. Mag., Vol. 50, p. 157, 1900. The shortest method 
of actually determining P is by aid of Palin Elderton’s tables for P with argument 
x’ issued in Biometrika, Vol. 1. p. 155, 1902. This is the process used in the 
measurements of sameness and divergence provided below. 


(4) On the Probability of the identity of the Strains discussed by Sir David 
Bruce and others. 


(a) I take first the question of the “sameness” of the Wild-game strains of 
trypanosomes as isolated from five antelopes—reedbuck, waterbuck, oribi, and two 
hartebeeste. Sir David Bruce and others discuss these strains in a papert of 
February, 1912, and conclude, apparently from the statistical data, that ‘the five 
Wild-game strains resemble each other closely and all belong to the same species.” 


Now these Wild-game strains have a distinct advantage for they are all 
obtained from the trypanosomes ultimately taken from the rat as host; they were 
passed from the infected antelope through healthy goat, monkey or dog, which 

* R. S. Proc. Vol. 86, B, p. 278. 


+ R. S. Proc. Vol. 86, B, p. 407, 1913. In the Table p. 405 for 2500 trypanosomes under the 
heading 31 microns read a frequency of 33 not 53. 


Karu PEARSON 93 


became infected, to the rat. The frequencies of lengths of the trypanosomes in 
microns were as follows : 


From Rat 15|16|17|18|19| 20 


21 | 22| 23 | 24\ 25 | 26 | 27 | 28 | 29 | 30 | 31| 382 \ 33 | 384| 85 


Hartebeeste (1) ... 
Hartebeeste (2) ... 
Oribi 2 ee 
Waterbuck 
Reedbuck 


Mzimba (Donkey)| 
Strain i 


I questioned first whether the strains found in the two Hartebeeste were the 


same; they give 
x? = 108'69, and therefore P < ‘000,000,1. 


In other words not once in 10,000,000 trials would two such divergent samples 
arise if the Hartebeeste strains were samples of the same population. I now 
compare the Waterbuck and the Oribi; these provide y? = 109°25 and P <-000,000,1, 
and again the ewtraordinary divergence, not the sameness, is the statistical 
feature. The reader may rest assured that equally incompatible results arise 
when we compare the other antelopes. Statistically we are compelled to assert 
either that the trypanosome strains in these different antelopes were different 
species, or that, not only the infected species of antelope, but the individual 
antelope of the same species (as in the case of the two Hartebeeste) immensely 
modifies the strain of trypanosome. In short not the “sameness” of the strains, 
but their great statistical divergence is the fact which impresses itself on the 
biometrician. No biometrician could possibly accept the view of Sir David Bruce 
and his colleagues that* : 


“Tt is evident from these tables and charts that the various strains of. this 
trypanosome, as they occur in wild game are remarkably alike. This is what 
might be expected. Here the trypanosome is at home; it is leading a natural 
life. It may be supposed to be saved from variation by constantly passing and 
repassing between the antelope and the tsetse fly.” 


Our authors, it will be noted, directly appeal for “likeness” of strains to the 
tables and charts. 


With these immense measures of statistical differentiation, we ask : what would 
be the values of x? and P, if examples of differentiated strains of trypanosomes could 
be found? If differences of host or treatment can produce these wide divergences, 
how without a preliminary study of the same strain in different hosts and under 
different treatments can we be certain whether these large divergences mean the 
same strain differently treated, or different species of trypanosomes ? 


* R. S. Proc. Vol. 86, B, p. 406. 


94 A Study of Trypanosome Strains 


(b) The next comparison I make is between the Mzimba (Donkey) Strain 
taken through rats and the above wild-game strains. I have added the data for 
the Mzimba Strain to the last table (p. 93): it is given by Sir David Bruce and 
others in a paper on the Mzimba Strain*. I compare the Reedbuck and the 
Mzimba (Donkey) strains first. We find: 


xy? = 53°37, P =-000,05. 


Thus only once in 20,000 trials would a divergence as great as this arise, if the 
two strains were samples from the same population. 


The results of comparing the Mzimba strain with Waterbuck and Hartebeeste (1) 
- give respectively 

x= 11423, P= <-000,000,1, 
and x = 00, = = 000000R- 


These give for practical purposes impossibility of a common source, thus still 
further demonstrating that the marked feature of the wild-game and Mzimba 
strains is divergence, not sameness. 


Sir David Bruce and his colleagues writet : “The trypanosome of the Mzimba 
strain is the same species as that occurring in the wild-game inhabiting the 
Proclaimed Area, Nyasaland.” In an earlier paper a diagramt is given of the 
frequency distribution of 3600 trypanosomes of Human strain taken from the rat 
alone. These are drawn from four native cases of sleeping sickness in Nyasaland 
and from one European case from Portuguese East Africa. As the individual 
cases for the rats alone are not given, they have had to be read off the per- 
centage diagram, but the frequencies must be very nearly correct. This Human 
strain may be compared with the 7. rhodesiense, the T. brucei, the Mzimba 
(Donkey) strain and a strain obtained from a native woman suffering from 
“ Kaodzera,” the so-called sleeping sickness of Nyasaland. The frequencies of 
these five strains are given in the following table. I first compare the trypano- 
somes of Nyasaland given as (b) above with 7. bruce: and T. rhodesiense, for this 
is the comparison made by the authors themselves§. 


Taking the trypanosomes of Nyasaland (b) and the 7. brucei as figured in 
percentage curves by Bruce and others, we have 


y2=7217, P< -600,000,1, 


or it is impossible to ascribe any degree of sameness to these two strains. We 
now compare the Nyasaland strain (b) with 7. rhodesiense, and find 


x? = 69:95, P=-000,01 ; 


* R.S. Proc. Vol. 87, B, p. 31, 1913. 

+ R. S. Proc. Vol. 87, B, p. 34. 

+ R. S. Proc. Vol. 86, B, p. 301. 

§ R. S. Proc. Vol. 85, B, pp. 431 and 433, 


Karu PEARSON 95 


thus once in 100,000 trials two such divergent samples might be drawn. Although 
there is less divergence than in the case of 7. brucei and Nyasaland (5), it is idle 
to speak of such a degree of divergence as sameness. 


Length in Microns. 


LONI | TAN 1S VLGNLT)| LS) | 19 | 20°) 21.) 22) 23 | 2h |.25 | 26 | 27 | 28 | 29 

| Mzimba (Donkey) (4) ~... | —|—|— = | 2}14| 41] 91] 79| 56] 53] 38] 39] 22} 19] 16} 15] 9 
Human, Native Woman (b)/—|—| 1] 4/19) 42) 63] 81] 75| 91) 65; 66; 93] 91) 107/}110/104| 87} 
Human, mixed (c) ...|/—!—}|—]| 1] 4/46] 111] 159 | 219 | 288 | 312 | 365 | 359 | 314 | 314 | 231 | 218 | 198 
T. brucei... eae ..{-—!| 5 | 8/14/17) 40] 63] 55) 66] 63) 75} 87} 93} 80) 82} 72) 50] 38 
| T. rhodesiense >... ...| 1 | 3 |10]19|29/35| 67) 54] 92] 51) 74] 56] 68] 59} 85] 61] 72] 50 

| | 
| a c eee Se el 
Length in Microns.—(continued). 
| 380 | 31 | 82) 33 | 34) 85 | 36:| 37 | 38 | 39 | Totals Remarks 
a — sz 
Mzimba (Donkey) (a) ...| 2] 2/ 2 (=| - 500 | &. S. Proc. Vol. 87, B, p. 31. 
eer? | | Rats only. 
Human, Native Woman (b)| 49} 27/23/13] 7| 1) 1 |—|—|—| 1220 | &. S. Proc. Vol. 85, B, p. 427. | 
| Various hosts. 
Human, mixed (c) ... | 132} 125]}90/59/30/13] 8 | 2 | 2 | —} 3600 | &.S. Proc. Vol. 86, B, p. 301. Read | 
; | from diagram. Rats. 
TT. bruces... oe | 27] 26/18/11} 4) 4; —|—| 2 |-—| 1000 | &. S. Proc. Vol. 84, B, p. 331. 
| | Read from diagram 
T. rhodesiense sen Peamozipe2o (lid. 13) Oat | Lo) — a 1 | 1000 | & S. Proc. Vol: 85, B, p. 227. | 
| Various hosts. 


To further establish our point let us compare the Human strain (c) for 3600 
trypanosomes with the 7’. rhodesiense. Here y? = 325'47 leading to P < ‘000,000,01. 
In other words the great degree of divergence for the case of the Nyasaland native 
woman is exceeded at least a thousand times, when we take the big example of 
four natives and one European. 

Sir David Bruce and his colleagues write of these strains : 

“(1) The trypanosome of the human trypanosome disease of Nyasaland is 
T. rhodesiense (Stephens and Fantham).” In other words the P =-000,01 is inter- 
preted as sameness. 


“(2) This is a distinct species, nearly related to 7. brucei and 7. gambiense, 
but more closely resembling the former than the latter.” In other words they at 
this date distinguished between 7’. brucei and T. rhodesiense*, and as a result of 
this distinction proposed to call the human trypanosome disease of North-east 
Rhodesia and Nyasaland by the name “ Kaodzera” as not being identical with the 
sleeping sickness of Uganda and the West Coast of Africa. If we, however, 
compare 7’. brucer and T. rhodesiense we find y? = 46°83 and P=-019. In other 


* R.S. Proc. Vol. 85, B, p. 433, 1912. 


96 A Study of Trypanosome Strains 


words once in about 50 trials we might expect to get two samples from the same 
population as divergent or more divergent than the distributions found for 
T. brucei and T. rhodesiense. We have in fact in the cases of these two trypano- 
somes reached our first instance of comparative sameness, and the statistics should 
have shown Sir David Bruce and his colleagues that 7’. brucei and T. rhodesiense 
were relatively the same, and though both differed from the human trypanosome 
of Nyasaland widely, the approach to 7. rhodesiense was only slightly closer. 


The accordance—speaking in a relative sense—of 7’. rhodesiense and T. brucei 
was asserted by Stephens and Fantham in March, 1912*. In May, 1912, Bruce 
and others, speaking of the 7. rhodesiense, term it a distinct species; in February, 
1918, they say—although without publishing further frequency distributions— 
that “There is some reason for the belief that 7. rhodesiense and T. brucei 
(Plimmer and Bradford) are one and the same species,’ + and in a further paper of 
the same month, “Evidence is accumulating than 7. rhodesiense and T. brucei 
(Plimmer and Bradford) are identical{.” In May, 1913 (R. S. Proc. Vol. 87, B, 
p. 34), we are told that the Mzimba strain is identical with the wild-game strain 
and that “it has already been concluded that this species is 7. brucei vel T. rhode- 
stense.” As far as the statistics of the subject go the only really weighty evidence 
for the identity is that of 1912, on which, without statistical analysis, the 
distinction between the two species was asserted. 


(c) We will next consider the possible identification of 7. gambiense with 
T. rhodesiense and with T. brucev. 


The second identification is seggested by Sir D. Bruce and others in the words§: 


“Whether these slight differences are fundamental or only accidental it is 
impossible at present to say, but enough has been written to show that Trypano- 
soma gambiense and Trypanosoma brucet approach each other very closely in 
shape and size.” 


The following table|| provides the data for 7. gambiense to be compared with 
the distribution of 7. rhodesiense ranging from 12 to 39 in the last table. 


Microns. 
ont | | | | | 
T5416 ee | 18| 19 ie 2) | 21 | 22 23 | 24| 25 26 | 27 | 28 | 29| 80 | 31 | 82 | 33 | 34| 35 | 86 | 37 | 88 | 39 | Totals 
| | | | al | 
| | | | | | 
9 | 21. m6 9 114 aie 85 | 61 | 47 a ie Sila Opiellen | Aaa eet = ze - | 1000 
lie =) I 


The Sear adie are ran a ate of hosts. 


For the 28 classes we have, y?= 140°27 and P<-000,000,1. The chief point 
therefore is the complete divergence, not the resemblance of the two series. 


* R. S. Proc. Vol. 85, p. 238, 1912. § R. S. Proc. Vol. 84, B, p. 332. 
+ R. S. Proc. Vol. 86, B, p. 407. || R. S. Proc. Vol. 84, B, p. 330. 
+ R.S. Proc. Vol. 86, B, p. 302. 


KARL PEARSON 97 


Stephens and Fantham, who term their work a “biometric study,” speak of 
“the general resemblance between the curves representing the measurements of 
these three trypanosomes (7. gambiense, T. rhodesiense, T. brucet).’ They con- 
tinue: “We do not consider, however, that identity of measurement would 
necessarily imply identity of species. We still believe that the difference in 
internal morphology, namely the presence of the posterior nucleus, is sufficient to 
separate 7’. rhodesiense both from 7. gambiense and T. brucei*.’ As a matter of 
fact the “ biometric study ” of the data does not indicate identity in the measure- 
ments, but confirms the result of internal morphology by proclaiming wide 
differentiation +. 


(d) We can now compare 7. brucei and T. gambiense. Of these Sir David 
Bruce writes: “Whether these slight differences are fundamental or only acci- 
dental it is impossible at present to say, but enough has been written to show 
that Trypanosoma gambiense and Trypanosoma brucei approach each other very 
closely in size and shapet.” The biometric commentary on this is that for length 
of the two series yx? = 126°52, giving P< ‘000,000,1 and that as far as size is 
concerned the samples ditfer immeasurably, ie. far beyond the limits of the 
calculated tables of P. 


We should thus conclude, merely from the statistical evidence, for close same- 
ness in 7. brucer and T. rhodesiense but for marked divergence of both from 
T. gambiense. 


* R. S. Proc. Vol. 85, B, p. 233. 

+ In a later section of this memoir I show that Stephens and Fantham have been markedly biased 
in their judgment of even and odd units of measurement (p. 129 below), and that the recognition of 
this makes a wide difference in the goodness of fit of my resolution into components to their data for 
T. rhodesiense. It seems desirable therefore to inquire whether this bias affects the test of ‘‘sameness”’ 
of T. rhodesiense with T. gambiense, T. brucei, and the Human strains (b) and (c), see the Tables 
pp. 95—6. The data were accordingly classified into groups of two microns, starting with 12 and 13, 
14 and 15, etc., so as to get rid of the even bias as far as possible, and we find : 


Old Unit Ranges New Two Unit Ranges 


Strains compared 
| on x? Ps n x? 1p) 


| 


TL. rhodesiense and T’. gambiense | 28 140°27 <'000,000,1 14 118°73 | < :000,000,1 


T. rhodesiense and T. brucei ... | 28 46°83 019 | 14 25°76 ‘018 
T. vrhodesiense and Human 
strain (b) a ae reales 28 69°95 000,01 14 45°92 000,06 
T. rhodesiense and Human | 
strain (c) | 28 325°47 < ‘000,000, 01 14 253°37 | < -000,000,01 


The bias towards even numbers of Stephens and Fantham has thus not substantially influenced our 
results, which still show the relative likeness of 7’. rhodesiense and T. brucei, and the marked divergence 
of the former from 7. gambiense and the human strains. 

{ R.S. Proc. Vol. 84, B, p. 332, 


Biometrika x 13 


98 A Study of Trypanosome Strains 


(e) It seemed well worth while to investigate how far the two Nyasaland 
strains of Human Trypanosomes given in the table on p. 95 agree or differ. The 
first (b) of these strains from a native woman of Nyasaland may be compared with 
(c) a compound strain from four natives and a European. We find 


x? = 172°36 
giving P < :000,000,1. 


In other words, the two Nyasaland strains from human beings are indefinitely 
differentiated. I now compare the Mzimba (Donkey) strain* (a) with human 
strains (b) and (c), we find: 


for (a) and (b) x? = 22316 giving P certainly < :000,000,01 ; 
for (a) and (c) x? = 348°55 . < 000,000,01. 


Thus the trypanosome strain found in the donkey appears to be absolutely 
incomparable with that found in man in Nyasaland, just as the strain found 
in the donkey differed from that found in wild-game. 


(f) We may now turn to a memoirt by Sir David Bruce and others com- 
paring the Mvera cattle strain, the wild-game strain, and the wild Glossina 
morsitans strain. They give on p. 18 of that paper the graphs for 500 specimens 
of T. pecorum, the wild-game strain, and of the wild Glossina morsitans strain taken 
from a variety of hosts. The following are the frequencies: 


Microns. 
_— * aa 
Strain 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | 19 | Totals 
== |. a 
Mvera Cattle Strain eee Met bana a 15 | 64 | 101) 186) 114) 59 Seal 500 
Wild-Game Strain ... we | —| — 2) 34 | 85) 172) 119 | 63 | 292) 3 | — 500 
Wild G. morsitans Strain ...| 1 | 4 | 16 | 42 |129) 147/103) 42 | 15} 1 | — 500 


We compare first Mvera cattle strain with the wild-game strain and find for 
our 10 categories 
x7 = 34554, P= 000,243. 


This is a relatively low degree of divergence considering that P has been running 
into 1 in 10,000,000! But it means that if these two strains were samples of one 
and the same population, we should only expect two such divergent samples to 
occur 1 in 4000 trials. 


* This Mzimba strain of trypanosome is discussed in a paper headed: ‘Morphology of the various 
strains of Trypanosome causing Disease in Man in Nyasaland.—The Mzimba Strain’ (R. S. Proc. 
Vol. 87, B, p. 26); it is said to be of the Nagana type and is identified by Sir David Bruce and 
colleagues with 7’. brucei vel rhodesiense, the source of the human trypanosome disease. 

t+ R. S. Proc. Vol. 87, B, p. 4. 


KARL PEARSON 99 


Next we find for Mvera cattle strain and the wild Glossina morsitans strain, 
x? = 40°508, or P=-000,008, 


or only once in 125,000 trials would a pair of samples so divergent arise when 
testing the same material. 


Lastly, testing the resemblances of wild-game strain and wild G. morsitans 
strain, we find 
x? = 35°41, or P =°000,2, 
not such a gigantic divergency as we have found in many cases, but a difference 
so great that it only occurs once in 5000 trials requires explanation as divergency 
and cannot be used as an argument for “sameness.” 


It will thus be quite clear that as far as the measurements of length go, there 
is wide divergence to be accounted for between the trypanosomes found in the 
cattle, the wild-game and the tsetse fly, and that statistically this divergence is the 
remarkable feature. Yet the conclusion of Sir David Bruce and his colleagues, 
arguing very largely from the frequency distributions, is that “The Mvera cattle 
strain, the wild-game strain and the wild G. morsitans strain belong to the same 


species of trypanosome, 7. pecorum*.” 
d 


It will be seen that actual statistical analysis does not in any way confirm the 
bulk of the conclusions reached by Sir David Bruce and his collaborators. The 
strains may or may not be ultimately of like origin, but what is quite clear from 
the analysis is that, if we are to rely on the measurements, then it is the diver- 
gence, not the sameness of these strains, which should have been emphasised. 
No stronger evidence could be deduced of the danger of appeal to statistics 
when the statistics are not handled by the trained statistician. The mere appeal 
to the resemblance of frequency curves given in the form of percentages, often 
based on widely different totals, is an only too common error of medical investi- 
gations ; it is by no means confined to the Scientific Commission of the Royal 
Society, Nyasaland. But it has recently become so marked a feature of Series B 
of the Proceedings of the Royal Society, that a vigorous protest is really needful. 
Thus in the very last part issued (Vol. 87, B, p. 89) occurs a paper on “ The 
Trypanosomes causing Dourine.” In this paper there may be microscopic evidence 
to differentiate the strains A, B and C dealt with; on that I cannot express an 


* A further conclusion is also reached (Ibid. p. 26) ‘‘7'. pecorum, Nyasaland, is identical with the 
species found and described in Uganda.” Unfortunately the species found in Uganda is dealt with 
in a paper (R. S. Proc. Vol. 82, B, p. 468) which provides no frequency distributions, and does not tell 
us the total number on which the mean length—13°3 microns—is based. The mean value of the 
T. pecorum, Nyasaland is 13-954 (R. S. Proc. Vol. 87, B, p. 3) and the standard deviation is 1-393 in 
microns, thus the probable error of the mean is °67449 x 0623. Assuming the Uganda trypanosome to 
be the same strain and to have the same variability as the 7’. pecorum, Nyasaland, the difference of the 
means = ‘654, with a probable error of *67449 x /2 x 0623 =-67449 x -088, thus the deviation of the means 
is 7:73 times its standard deviation. A deviation so great would only occur about once in 4 x 10" trials, 
i.e., would be practically impossible if the two strains were identical. Here again it is excessive 
divergence not sameness which the statistics indicate. 


13—2 


100 A Study of Trypanosome Strains 


opinion. But on pp. 92—3 percentage frequency curves are drawn for the three 
strains, and the following remark is made : 


“A survey of the curves obtained by plotting out in percentages the various 
lengths of trypanosomes encountered in each of the three strains is of interest. 
It will be observed that in the case of rats the curves of each of the strains corre- 
spond fairly closely.” 

Now what do the authors mean by “fairly closely”? In their conclusions 
they identify B and C and differentiate A. Unfortunately they have not given 


their actual frequencies, and I have had to endeavour to reconstruct them from 
the percentage curves. There results for the rat-data: 


Microns. 

| i eel ier ae | | 
| 16|17|18| 19 | 20| 21 | 22| 23\ 24| 25 | 26 | 27] 28| 29 | 30| 31 | 82| 83 | 84| 85 | 36 | Totals 

| | | | | | | 
oo | | | | | cae | | | z 
| [oe es te | ae ea | | 
Berlin Strain A . | 1 | 1 {10] 9 |12/17|17)22|28]48 |47 |57/55|42 |39!37|/28/13| 81 6 | 3 | 500 
Frankfurt Strain B...]—|—| 1] 3 | 5) 1] 4/10] 20/29*|18* | 25 | 24 | 35* | 23} 15)18}15] 8 | 3 
East Prussian Strain C}—-| 1 | 4 3 | 6|12)15)22|24)27 | 28-|37|31])16 |10; 7) 5; 2);-—|— 
ee 7 . : | | | | | | 


We obtain the following results: 
Strains A and Bs) ¥7=3111, P= 0627, 
Strains A and C: y?=43°37, P= -0034, 
Strains B and @: 4°=72:72, P=<:000,001 


Thus to judge from rats only, Bb and C are far more divergent from each 
other than either is from A; in other words the strain A is intermediate between 
B and C and closer to 5, from which it is not immensely divergent; two such 
samples as A and B might, as far as the length distributions go, be drawn from 
common material once in 16 trials. 


Now of course no one suggests that a conclusion drawn from this rat-material 
is to replace one drawn from guinea-pig material, but the statistician cannot agree 
that for rats “the strains correspond very closely”; and he finds it illogical to place 
the evidence of the rat-data on one side and proceed to draw conclusions from the 
ocular inspection of the guinea-pig curves, without noticing that the conclusion is 
markedly opposed to the proper deduction from rat-data. Indeed while the guinea- 
pig-datat give a relatively high degree of relationship between B and C (P =:0157) 
it is not as high as the rats give between A and B (P=-0627); and while the 


* The values given by the percentage graphs in these cases are respectively 21, 17 and 34, and 
the total appears to be 247 and not 250 as stated. Hither 247 were used or the graph is in error. 
The three individuals were introduced in a way calculated not to increase divergence. 

+ The frequency distributions for the guinea-pigs have had to be reconstructed from the percentage 
curves, the necessary data not being published by the authors. 


KARL PEARSON 101 


relationships of A and B (P< -000,000,1) and A and C (P< 000,001) are very 
low, the origin of the second hump in the guinea-pig distribution for A requires 
much more analysis and the certainty by control experiments, that it always 
repeats itself, and is not the result of hitting a “ pocket.” 


¢ 


It seems to me that any statistical analysis by modern methods of the trypano- 
some data compels us to confess that either statistical methods must be discarded 
entirely in these trypanosome investigations, or they must be pushed to their 
logical conclusion, and used as the fundamental instrument of research which can 
guide our enquiries by inference and suggestion when, and when only, it is handled 
by the trained craftsman. Thus far the use made of statistical methods seems 
merely to have confused the issues, and brave would be the man who would venture 
to say after reading this section of our present paper that any two strains discussed 
by the commission are definitely “same” or certainly differentiated. 


(5) On the Probability that the Animal in which the Trypanosome vs cultivated 
makes essential Differences in the Distributions of Frequency. 


But the very method which casts apparent discredit on the results at present 
reached seems able to lead us to definite conclusions provided we start with it as 
the fundamental mode of investigation. Really very little inspection seems to indi- 
cate that not only the host but the period of infection materially influences the 
frequency distribution. These points have not been wholly disregarded by the in- 
vestigators in this field, but they have had no quantitative measure by which they 
could appreciate the relative influence of the various environmental factors. Nor 
indeed could the method be fully applied without experimental observations on 
trypanosomes of the same strain subjected to differential treatment. Knowing in 
such cases the quantitative divergence produced, we should be in a position to infer 
whether two strains from different sources were separate species or merely modified 
by differential environment. Until we have such quantitative measure no hypothesis 
of sameness or difference can flow from statistical treatment; nobody as yet knows 
how much to attribute to environment, how much to attribute to individuality 
of strain. 


In endeavouring to throw light on this matter we are, however, checked at 
the very start by the absence of effective material. In some cases the period of 
infectivity is not given; in others we are not always able to break up the total 
frequency by reference to the host, or to a single host. And even when we merely 
classify by one type of animal as host, we may have reduced our material to such 
small numbers that samples may be “same,” which on larger numbers would 
show the marked divergence due to the emphasis of smaller differences*. Some 
suggestive points can, however, be effectively dealt with and they are treated in 
the following paragraphs. 


* It may not be possible to differentiate Bavarian from Wiirtemberger on samples of 50 crania, 
although quite possible on samples of 400. 


102 A Study of Trypanosome Strains 
(a) I ask what difference is made when a strain is passed through various 
animals (goat, monkey, dog, rat) or through a single animal alone. Taking the 


wild-game strain discussed by Sir David Bruce and others*, we have: 


Microns. 


10 | 11| 12 | 18 | 14 | 15 | 16 | 17 | 28 | Totals 


Wild-Game Strain 

(from various ae 
Wild-Game Strain 

(from a single rat 510) 


Here we find y? = 65°37 and P < ‘000,000,1. In other words the distribution of 
lengths of the trypanosomes of the wild-game strain obtained from various animals 
differs so enormously from that obtained from a single rat that the two cannot be 
looked upon as samples of the same population. The moment this result is realised 
we appreciate that (1) it is impossible to compare two strains developed in a variety 
of animals unless we have previously tested on the same strain the equal valency 
of these animals, (11) a series of animals of even the same species may quite 
possibly give widely divergent results from those obtained for a single animal. 
Thus passing from a variety of animals in wild-game strain to a variety in wild 
G. morsitans strain makes less difference (P = ‘000,008)—although great enough— 
than passing from a variety of hosts to a single rat in the wild-game strain. 
This rule is not universal, but it illustrates the absolutely essential need for 
testing the effect of change of host before questioning the identity or non-identity 
of two strains. 


(b) I now turn to the Mvera cattle strain, and ask what differentiation is 
produced by the dog and goat as hosts. The data are very sparse and unless we 
get a high degree of resemblance may be worth little. They run+: 


* R. S. Proc. Vol. 87, B, pp. 6 and 8. 
+ R. S. Proc. Vol. 87, B, p. 3. I tested the relative interchangeability of goat and sheep in the case 
of T. caprae. The data are as follows: (R. S. Proc. Vol. 86, B, p. 280) 


Microns. 
eee ie | | Wi tiee pee a | 
T. caprae | 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 82 | Totals 
—— | | 
Goat .. |—|—] 8 | 7 | 11] 85 | 43 | 50 | 88 | 28) 27; 17) 5 | 1 | —} 260 
Sheep... | — | — J 1 | 10 | 12 | 29 | 39 | 31 | 28 | 20 5 | 3 id 1 180 | 


leading to y?=18-088 and P=-1133 or the resemblance is considerable although not so great as we find 
between goat and dog for the Mvera cattle strain. 


KARL PEARSON 103 


Microns. 
| > | ] 
| DON Le | 12" TS tp \\ 15 | 16 | 17 | Totals 
| 
fs | | | 
Mvera Cattle, Goat ... | 1 Wed ala 225 26°) 19 is.) i 100 
” ” Dog | 


a eon ot lala ag ZI 8, | =— | 100 


We have x? = 5°396 leading to P= "714, or in 71 pairs of samples out of 100 
from a homogeneous population, we should get more divergent results. It follows 
therefore that, as far as these small series of this strain go, goat and dog are 
interchangeable as _ hosts. 


Let us go a stage further and ask whether ox is interchangeable with goat and 
dog. The following is the frequency distribution for the trypanosomes through 
the ox: 


Microns. 
9 10 ae ei 13 || Lo |G NLT | is Totals | 
, a\2 _| 
| | 
Mvera Cattle, Ox ... | — | A Pis | 33 | 44 | 49|91| 7 | 1 | 180 | 
| 


Compared with the goat strain, this gives 
‘x? = 9559 and P =°3888, 
and compared with the dog strain 
xy? =9:461 and P= ‘3973. 


Thus in about two out of five trials from a same population we should get 
pairs of samples differing more than the dog and goat strains do from the ox 
strain. We conclude that while for practical purposes dog, goat and ox strains in 
the Mvera cattle trypanosomes are interchangeable, yet the dog and goat strain 
are nearly twice as much alike as the ox strain is to either. Lastly—although it 
is rather a rash proceeding—I compare rat with goat and dog. It is rash because 
only 40 trypanosomes through the rat were measured, and this is wholly inadequate 
for real determination. The frequencies for the lengths are: 


ae oe ee ee ae 
| | 
| 9 | LO tL | 12) 18 14 | 15 | 16 | 17 | Totals 
| | 
—| | are 
Mvera Cattle, Rat ... sas fe NN I | alee oy 40 
; »  Dogand Goat | 1 | 1 | 6 | 25 | 49 | 56 | 40) 21) 1 | 200 


We find y?=21'329 and P=-0064. The small series of rat trypanosomes 
probably accounts for no smaller value of P, but the odds of 155 to 1 are 
sufficient to show that rat series must not be mixed with series from the goat, 


104 A Study of Trypanosome Strains 


dog or ox. This confirms the view obtained for the wild-game strain, that a 
strain taken through the rat as host is incomparable with strains from other 
animals. 


(c) The totals considered for one species of host in (a) and (6) are rather 
small. Larger numbers are forthcoming for the so-called Mzimba strain of 
trypanosomes taken from a donkey at Mzimba. The frequencies are here*: 


| ” ” 


Microns. 
16 | 17 18/19 | 20 | 21 | 22| 23| 24 | 25 | 26 | 27 | 28 | 29 | 30 | 31 | 32 | Totals 
| ; | | } | | | | le 4 | | | 
Re cal as 
Mzimba Strain, pos | 3 17 | 56 69 | 67 | 47 | 27 | 22 | 10/12] 7 4| 4 | 4|2)1 | 360 
e | 2 i 41 91 79 56) 53 | 38/39 22/19/16/15 | 9 | 2/2/21) 500 


este 

We find y? = 25499 and P=-0619. Thus only about once in 16 trials should 
we get such a degree of divergence as the two samples present, drawing them from 
the same population. This is very far from such a divergence as we have noted 
in the rat and dog for the Mvera cattle strain, or in the case of rat against other _ 
animals in the wild-game strain, which was extremely large. The only expla- 
nations that occur to me here are: 


(i) In the case of the wild-game strain and the Mvera cattle strain a single 
rat seems to have provided all the trypanosomes, while in the case of the Mzimba 
strain two rats were used; this might lessen the influence of individuality. 


(ii) In the case of the Mvera cattle strain and the wild-game strain the 
trypanosomes were ultimately taken from a great number of individuals. In the 
Mvera cattle case we are told that 32°/, of the herd were affected, and we have 
some details of 16 head of cattle and 5 donkeys naturally infected+. In the wild- 
game case, the wild game affected were very numerous, covering cases of eland, 
reedbuck, waterbuck, bushbuck, oribi, koodoo, hartebeeste, buffalo and hyaena. 
Now can we start with the hypothesis that all the individual cattle and all the 
individual wild game were each bitten by a fly carrying the same strain of 
trypanosome? Have we any more right to suppose @ priort that one wild- 
game strain of trypanosome and one cattle strain of trypanosome exist, and ask 
whether these two are identical, than to ask whether the strains carried by hyaena 
and hartebeeste are the same? We have already (p. 93) seen that the strains 
from two hartebeeste are extremely divergent. What right have we @ priort 
to classify all wild-game trypanosomes together and call them a wild-game strain ? 
And if two antelopes, whether of the same or of different species, give widely 
different results, why are the trypanosomes of oxen of the same herd or donkeys 
and oxen from the same neighbourhood to be classed @ priort as of one species ? 


* R. S. Proc. Vol. 87, B, p. 31. 
+ R. S. Proc. Vol. 87, B, p. 15. 


KaruL PEARSON 105 


If we turn to the Mvera cattle, we find there were four sources of trypanosomes 
for the ox, two for the goat, and the same two for the dog—these two sources being 
two of the four cattle sources. There was only one source for the rat, but I have 
not discovered how far it was identical with one of those for ox or goat*. In the 
Mzimba donkey strain there was one source for dog and rat. In the wild-game 
strain there were, I make out, ecght sources of trypanosomes for the goat, four for 
the dog, and only one for the rat. 


Thus the individuality, which might be supposed to influence the result, 
because we are treating of trypanosomes in this case from a single rat, in the 
Mvera cattle case from a single rat, and in the Mzimba data from only two rats, 
may really arise from the fact that the rat strains in each case are derived from a 
single source, while the dog, goat and ox strains show a multiplicity of sources. 
The troublesome point is that the experimental part of the work has not been 
designed to answer what seem to me fundamental questions. We cannot directly 
inquire what difference the host makes because different hosts have rarely been 
treated with the strain from a unique source. We can say that dog and goat are 
interchangeable for the Mvera cattle strain, because both drew trypanosomes from 
the same two sources ; but we cannot determine whether the difference in the ox 
is due to difference of the host, or to the introduction of two more sources. Simi- 
larly the divergence between the trypanosomes from rat and from other animals for 
the wild-game strain may be due to using one rat and therefore one source, and not 
the many sources of the other animals, or it may really be due to the differentiation 
of the host. In the same way the difference between the two hartebeeste may be 
due to individuality in the same species, or to infection from different strains. 


(d) To some slight extent we may appreciate the effect of individuality by 
comparing the two rats 512 and 513 im the case of the single source, the Mzimba 
strain f. 


The frequencies are as follows : 
Microns. 


| | Neal ee lee | 
| Mzimba Strain | 16 | 17 | 18 | 19| 20 | 21 | 22 | 23 | 24 | 25 | 26 
| | | | 


| 
| 
| 
| 


7/1/1/2] 240 
2/1 He 260 | 


Rate? °... |—| 5 
(Rat 513. 2) 9 


{ 


The numbers are not as large as we should like; but they give 


y2=17'89, P ="3306. 
* R.S. Proc. Vol. 87, B, pp. 2 and 15. 
+ R. S. Proc. Vol. 87, B, pp. 6 and 8 compared with 5. Rat from p. 8. 
t R.S. Proc. Vol. 87, B, pp. 29 and 31. 
x 


Biometrika 14 


106 A Study of Trypanosome Strains 


Clearly then two samples as divergent as those found wonld occur on the 
average once in three trials. It follows that two individual rats are really inter- 
changeable and we note that the extent to which ox is interchangeable with dog 
or goat for the cattle strain is very much the degree in which two rats are inter- 
changeable. To judge from this single instance, individuality within the same 
species of host is not very important, and when we find two hartbeeste differing 
as those considered on p. 93, it seems much more likely, with the information we 
have at present got, that the hartebeeste were infected with different strains of 
trypanosome than that their individuality produced the enormous divergence 
noted. Again the sensible divergence between Mzimba strain in dog and rat on 
p. 104 is probably due to difference of host, but the enormous difference in the 
wild-game strain between a single rat and dog and goat on p. 103 is probably due 
to differences in the strains of trypanosomes in the various types of wild game 
dealt with. We may consider whether the dog and goat data for the wild-game 
strain differ sensibly. We have* 


Microns. 
i oe : : 
| | 11 | 12 | 18 | 14 | 15 | 16 | 17 | 18 | Totals 
+ : 
| Wild-Game Strain, Goat ... | 1 | 16] 37 | 73 | 38) 26| 8 | 1 | 200 | 
| 3s = 5 Dogs... | — | 12) 31))957 | 50 | 24 | 6. | ==s\eas@ | 
| eal a 


Here y?= 6:04 and P ='5378. Thus in more than half the trials we should 
obtain from homogeneous material pairs of samples more divergent than those for 
dog and goat. This confirms the view formerly expressed that as far as trypano- 
somes are concerned dog and goat are interchangeable. We cannot yet say that 
they are not interchangeable with the rat, as the mixture of strains in dog and 
goat and the uniqueness of strain in the rat may account for the marked 
divergence of the latter. Sir David Bruce and his colleagues do not appear to 
have noticed the wide divergence of the distribution of the rat from the dog and 
goat either as indicating the heterogeneity of the wild-game and the cattle strains 
of trypanosomes, or as suggesting such wide differentiation of strain by the host, that 
rat-material cannot be mixed with that from dog and goat. They do, however, 
remark of the wild-game strain: “In this the rat is not a suitable animal, since 
many strains of 7’. pecorwm have no effect on it}.” This suggests that 7. pecorum 
is not homogeneous and that the rat exercises a selective influence on its strains. 
The suggested rejection of the rat data seems, however, to be based upon the in- 
convenience of its non-infectivity, and not on what might turn out to be of great 
importance a selective influence on wild-game or cattle strains. It is not possible 
to test this selective power in the present mstance, as we do not actually know 
how heterogeneous either the cattle or wild-game material used really was. 


* R.S. Proc. Vol. 87, B, p. 7. 
+ R. S. Proc. Vol. 87, B, p. 7. 


Karu PEARSON » 107 


(e) If we turn to the 7. pecorum strain as actually found in the tsetse fly, we 
see that Sir David Bruce and his colleagues deal with these trypanosomes passed 
through a variety of animals, of which only goat and dog supply sufficient numbers 
for any even approximately accurate treatment. The data are as follows*: 


Microns. 


beg. zo") 27 es 14| 15 | 16 | 17 | 18 | Totals 


= rales ; | | 


Wild G. morsitans strain: Goat | 1 | 3 | 12 | 21 | 55 | 60 | 32 | 12) 4 | —} 200 


| 

. ¥¢ ” Doe |=") == 13) 144) 34. 41, | 40 | 19:1 9 | — | 60 
i | Ea 

Wild G. morsitans strain: Rat | — | 1 | — | 3) 22 | 28/19] 6 | 1 | — 80 


For goat and dog we find y?= 19°518, which give P=-0125. The resemblance 
is therefore far less than we have found for goat and dog in other strains, only 
once in 80 trials from homogeneous material would two samples of such divergent 
character arise. Before we comment on this it seems desirable to compare the 
very inadequate rat data. 


For rat and goat we have 

x? = 12201, P='1434. 
For rat and dog we have 

x?=11:370, P =-1245, 


Accordingly we see that for this material the rat strain (i) lies between the 
dog and goat strains, and (11) is definitely interchangeable with dog and with 
goat, while the dog and goat are much more divergent. Now the sparsity here of 
all the data must prevent any dogmatism; all we can reach is suggestion for 
further investigation. But the following points should be notedt. The trypano- 
somes through the goats were obtained from sva different goats, infected directly 
from the wild fly; the trypanosomes from the dogs were obtained from only four 
different sources, namely from a monkey directly infected by the wild fly, from a 
dog directly infected, and from two goats (89 and 125), the former only of which 
is identical with one of the former six goat sources. Lastly, the rats were infected 
from one dog alone, upon which the tsetse flies had directly fed. This dog is not 
identical with one of the dog sources. Now unless we assume that all the strains 
of the trypanosome found in the tsetse fly are identical—which is certainly not in 
accordance with the differences found in the strains of wild game from the “ fly- 
country ’—it is by no means certain that the trypanosomes obtained from wild 
G. morsitans, through goat, dog and rat as above noted came from anything like 
the same sources. Further, the closer resemblance between rat and dog strains 


* R. S. Proc. Vol. 87, B, p. 11. 


+ R. S. Proc, Vol. 87, B, pp: 10, 11, and 19 to 22, 
14—2 


108 A Study of Trypanosome Strains 


may simply be the result of the rat strain having been developed in the dog as 
host. The divergence between the dog and goat strain may again be solely due to 
the greater variety of sources in the goat. The data from the wild G. morsitans 
experiments seem to indicate that the observed divergences between the strain 
from rat and the strain from goat or dog may not be due to difference of host; 
but to difference of source from which the material was drawn, and to difference of 
treatment of the individual stock of trypanosomes, e.g. the number of hosts, ete., 
through which it has passed. 


It seems absolutely certain that at the present time most light would be 
thrown on the conditions for asserting sameness or diversity of strains, by well 
devised experiments on strains from single sources passed through different species 
of hosts in different manners, in order to determine the exact measure of divergence 
produced by host and by treatment, and ultimately to devise a standard treatment 
for all strains which we desire to compare. 


The exact nature not only of host, but of standard treatment is most vital. We 
can demonstrate the influence of treatment at once by considering the “ percentages 
of posterior nuclear forms among short and stumpy forms” recorded by Sir David 
Bruce and his colleagues for the wild-game strain*. All the trypanosomes were 
from rats, and although the date of infection of the rat is, I think, not stated, the 
dates of first extraction will be after much the same interval, and we can therefore 
classify by date from first extraction. We find the following table: 


Wild-Game Strains. 


Percentage of Posterior-Nuclear Forms among 
Short and Stumpy Forms. 


From first Extraction 21°/, and under 22°/, and over | Totals | 
| 


6 days and under 18 6 24 
| 7 days and over 6 18 24 
| Totals 48 


Using Sheppard’s formula for the four-fold table, we have for tetrachoric r 
Oe 


or, the correlation between this character of the trypanosome and the time 
after infection of extraction is very considerable. It will be obvious that in a 
standardised treatment this time of extraction will play a most important part. 
But it again is not independent of the species of trypanosome, for if we take the 
wild Glossina morsitans strainst, we find : 


* R.S. Proc. Vol. 86, pp. 396—404, Tables III, VI, IX, XII and XV. 

+ R. S. Proc. Vol. 86, B, pp. 410—418, Tables III, VI, IX, XII and XV. I have added one percentage 
by random selection from the complete table by lot in order to give 60 cases, and save labour in 
fractionising. 


Kart PEARSON 109 


Percentage of Posterior-Nuclear Forms among 
Short and Stumpy Forms. 


From first Extraction 7 °/, and under 8 °/, and over Totals 


6 days and under 18 30, | 
12 


7 days and over 


Totals 30 


leading to r = — ‘309. 

In other words using tsetse fly strains and not wild-game strains, but the same 
host, we find that now the correlation is negative or the longer the infection the 
smaller the percentage. Actually the five G. morsitans strains show remarkably 
irregular results compared with the results for the wild-game strains; the ex- 
tractions were spread over much the same period, 13 to 14 days on the average, 
but were somewhat more numerous for the G. morsitans. Thus even the same 
method of extraction may give widely varying results according to the nature 
of the strain producing the infection, although the host be the same. 


To the statistician who examines the frequency distributions provided by 
Sir David Bruce and his colleagues for both wild-game strains and Glossina 
morsitans strains, there can hardly remain a doubt about the heterogeneity of 
the material in each case. We have already demonstrated this statistically for 
the wild-game strains. These strains not only differ by immense differences 
inter se, but intra se they are clearly heterogeneous. Whether this heterogeneity 
is due to the mixture of separate strains, to dimorphism within the strain, or to 
the combination of material drawn from the rat at various stages of infection, it is 
not possible on the material at present available to determine finally. The same 
remarks apply with even greater certitude to the wild G. morsitans strains than to 
the wild-game strains. But we shall return to this point in the last section of this 
paper. We have already noted that Sir David Bruce and his colleagues identify— 
against the weight of the statistical evidence—the Mvera cattle strain, the wild-game 
strain and the wild G. morsitans strain as belonging to the same species 7’. pecorum*. 
They had previously identified other strains in wild game, G. morsitans and human 
beings} with 7. rhodesiense which they elsewhere describe as vel brucet{. This is 
again, I hold, against the weight of statistical evidence. But it is not clear from 
the memoirs themselves what is the exact process by which an individual fly, an 
individual human being, or the blood from a specimen of wild game is credited 
with carrying a homogeneous strain. The sizes are so different in the cases of 
T. pecorum and T. simiae that there may be no difficulty in distinction, but the 
range is so great and to the statistician the material seems so heterogeneous in the 
ease of T. brucei vel rhodesiense that, perhaps, a fuller description by the authors 

* R. S. Proc. Vol. 87, B, p. 26. 


+ R. S. Proc. Vol. 86, B, p. 42. 
+ R. 8. Proc. Vol. 86, B, p. 426. 


110 A Study of Trypanosome Strains 


of the process of differentiation would aid him. This is of especial importance 
if it should turn out, as I suspect, that the trypanosomes classed as T. brucei are 
either dimorphic, or belong to two different species. 


In another paper* we find the trypanosomes from G. morsitans, on the basis of 
their infective powers on monkey, goat and dog, resolved into 7. brucei vel rhode- 
stense, T. pecorum, T. simiae and T. caprae. But it is clear that the differentiation 
was not done solely by infectivity, or there would have been no means of dis- 
tinguishing 7. bruce: and T. pecorwm which attack all three—monkey, dog and 
goat. The question arises, whether 7. pecorum, T. simiae and T. caprae being 
readily identified by microscopic examination or size, the remainder was classed as 
T. brucei, in which case the question of the heterogeneity of this group, which 
appears to attack all animals, is rather supported than otherwise by this paper. 


Frequencies of the Various Strains for Length. 
Length in Microns. 


| -.| | | | | | 
Strain 9 |10)11| 12 | 28 °\ Th \eioy| 6 ety 78 | 19 20 21 | 22 | 28 | 24 | 26 | 26 
| Ae aaa . 
T. pecorum 2 | 6 | 42/193 | 452] 618/ 453/178] 51] 5 mee 
T. simiae —{|—|—| — | — | 7) 28) 76) 93) 126) “92)/0 47i\" 221) eG io) ae ee 
T. caprae —;/—-—|;-|]—- 1}— | 3] 8) 28) 49) 79.) 95) 80 
| 
(i) 7. rhodesiense —i—|—]1 3 10} 19] 29] 35! 67 | 54) 92 | 51| 74}> 56! 68 59| 85 
Gal Ze bruce, 62-9) — ao 8| 14] 17} 40] 63) 55 66 63)| 75) 87) 93] 80] 82 
(ii) 7. gambiense... | —|—|—]— |} 1 | — 9| 21| 56| 79|114/122 110] 85] 85] 61] 47] 49 
(iv) Mzimba Strain | 8| 27) 791175 189 139/109} 72) 66) 36) 32 
(v) G. morsitans ... | = 7| 3L| 148 | 230 | 326 252 237 | 184) 143 | 115 | 130 | 110 
(vi) Wild Game ... | 1 8} 53/118 | 252 381 | 348 | 285 | 200 | 162 | 149 | 135 
(vii) Human Strain | — | —|—}— | — 1} 10/ 41/154) 325 | 494 | 528 577 | 512 | 525 | 511 | 464 | 425 
| (viii) Chituluka... | |= | - 1 8} 48} 81, 78) 71) 44, 46] 56) 53) 98) 120 
| | | | | 
Length in Microns—(continued). 
| | jie | | | | 
Strain 27 | 28 | 29 | 30 | 31 | 32 | 33 84 | 85 | 36 | 37 | 38 | 39 | Totals} Source 
- | ee ae | aes 
T. pecorum sti - | | |—|—| 2000 | R. S. Proc. 87, B, p. 13 
7. simiae | esa ea 500 | Ibid. 85, B, p. 477 
T. caprae 68| 57; 24) 9) 2) 2)/—|—|} - 500 | Zbid. 86, B, p. 278 
(i) TZ. rhodesiense | 61| 72| 50] 52| 28| 13/13|-5| 1| 1|--|—| 1] 1000 | Jéid. 85, B, p. 227 
(i) 7. brucei 72| 50) 38| 27) 26) 18)11} 4) 4)/—|—j| 2 |—] 1000 | Jbed. 84, B, p. 331 
Gii) ZT. gambiense 47| 44) 31] 20] 11] 4] 4 - -—|—-| 1000 | Ldzd. 84, B, p. 330 
(iv) Mzimba Strain | 24] 22) 16) 7) 4] 4)—| | —|—| 1000 | Zbcd. 87, B, p. 31 
(v) G.morsitans ... |127|133/113} 96; 54; 44/11) 7; 2;—|—|—]/— | 2500 Lbid. 86, B, p. 419 
(vi) Wild Game ... | 125/110] 62} 55} 33] 12] 7/ 3} 1|/—|--|—|—| 2500 | Jbid. 86, B, p. 405 
(vii) Human Strain | 372] 347 307 | 198 | 167 | 123 | 77 | 36} 12/11} 2 | 1 |-—| 6220 | Zoid. 86, B, p. 330 
(vill) Chituluka 111 | 128/138} 99/117} 91/63/27/11| 9) 1 | 1 |—J| 1500 | Zbed. 86, B, p. 291 


* R.S. Proc. Vol. 86, B, p. 422. 


KARL PEARSON 111 


At any rate the exact method of differentiation adopted would be of interest 
to the statistician. The result of the paper is that the four species of trypanosomes 
occur in quite comparable permilles of tsetse flies caught in the sleeping sickness 
area of Nyasaland, and there is no evidence to show that they or other strains also 
may not occur side by side in the same fly or in the same specimen of wild game. 
Further, these compound strains would then appear in different proportions in the 
host. Some such hypothesis seems very needful to account for the extreme 
heterogeneity of the wild game, wild G. morsitans, and human strains as recorded 
by Sir David Bruce and his colleagues. The following table gives a comparison of 
what appear to be homogeneous strains—T. pecorum, T. simiae and T. caprae— 
with what appear statistically to be heterogeneous strains, ie. 7. brucer, 
T. rhodesiense, T. gambiense, the Mzimba strain, the wild-game and wild G. 
morsitans strains of human type, and the human strains themselves. The table 


Means, Standard Deviations and Coefficients of Variation of eleven Strains 
of Trypanosomes. 


Seri M Standard Coefficient 
Bs ean Deviation of Variation 
T. pecorum 13°992 + :019 1°2816 +014 9°16 + ‘099 
T. simiae 17°870 + 050 1°6558 + ‘035 9°27 +°199 
T. caprae 25508 + :063 2°1011 +045 8°58 +184 
() T. rhodesiense ... 23°577 +°100 4°6764+ 071 19°83 +°311 
Qi) ZT. brucei 23529 + 094 4°3938 + ‘066 18°67 +°291 
Qui) Z. gambiense 22°113 + ‘081 3°7867 + ‘057 17°12 + °266 
(iv) Mzimba Strain... 217413 +:063 2°9586 + 045 13°82 + 212 
(v) G. morsitans 22°695 + 058 4°3002 + 041 18°95 + 187 
(vi) Wild Game se 22622 + 047 3°4541 + 033 15°27 +:'174 
(vii) Human Strain ... 23°796 + °035 4°1262 + 025 17°34+°108 
(viii) Chituluka 26°172 + ‘084 4°8414+.060 18°50 + °235 


above, gives the means, standard deviations and coefficients of variation of these 
strains. It will be seen that the first three are of a very different character to the 
last five. The variation of the latter is about double that of the admittedly pure 
strains, and throughout the whole course of our further work this possibility of 
heterogeneity, and the differential selection of the components by the host must 
be borne carefully in mind. Great divergences do not discourage the use of 
biometric methods, and we get occasionally identities of strains which are quite 
beyond the limits of chance coincidence and which point to definite possibilities if 
only host, environment, and treatment are once effectively standardised. I propose 
to try to throw some light on these points in the remaining sections of this paper. 


(6) On the Probability that Strains are alike after allowance for the Host. 


(a) Luckily in certain cases the treatment has been more or less alike. Thus 
in the wild Glossina morsitans strain, the tsetse flies brought to the Laboratory 


112 A Study of Trypanosome Strains 


from the “ fly-country” were in one strain (I) fed on a monkey and in the case of 
four other strains (II to IV) fed on dogs. From these animals thus infected others 
were inoculated, but in each case only the trypanosomes from a single rat were 
used for purposes of measurement and comparison. ‘The following table gives the 
frequency distributions of the five strains, and chiefly on the basis of these 
distributions, Sir David Bruce and his colleagues conclude that: 

“The five wild Glossina morsitans strains resemble each other closely, and all 


belong to the same species of trypanosome.” (p. 421.) 


Wild G. morsitans Strains*. 


Microns. 


Strain I 


|] 


Se Oe 


Investigating the statistical measure of resemblance 


230 | 326 | 252 | 237 
| | 


| 25 


143115 130 


7 


the following series of results : 


Strains I and IT: x? = 81°88, P < 000,000,1, 
Strains I and III: Va aLoosil, P < :000,000,01, 
Strains I and IV: ye OOS: P < :000,000,1, 
Strains I and V: 2— 115°77, P < :000,000,1, 
Strains II and III: x? = 32812, P < :000,000,01, 
Strains II and IV: x? = 184°88, P< :000,000,01, 
Strains II and V: x? = 208:79, P < :000,000,01, 
Strains III and IV: x? = 122°79, P < :000,000,1, 
Strains III and V: x? = 147-20, P < :000,000,1, 
Strains IV and V: x? = 23°90, P =:2470. 


in the usual way we have 


Statistically therefore there is not the faintest resemblance whatever between 
any pair of these strains except the IV and V. These strains are for practical 
purposes interchangeable. In one out of every four trials two pairs of samples of 
500 from the same trypanosome population would give results more divergent than 
those observed. But what is the source of this resemblance? Why are these two 
strains alike and all the others widely divergent? There is nothing whatever in 
the paper to account for this agreement, and it is the more remarkable because 
Strains IV and V are to the statistician the most compound looking of all the 
strains. But some uniformity of origin or treatment has caused the two com- 
ponents to appear in like proportions, and at the back of this resemblance there is 
some vital point, if we could follow it up. Were the two dogs bitten by the same 

* RS, Proc. Vol. 86, B, p. 409 et seq. 


fly, or Rats 658 and 660 really inoculated from the same dog ? 


Kart PEARSON 


Clearly 


113 


there is a 


point here which ought to be cleared up, for otherwise the statistician could only 
conclude that the wild G. morsitans strains are widely divergent, and that their 
compound nature suggests that the tsetse fly carries various types of trypanosomes 


and these in varying proportions, 


(b) I now turn to the five human strains dealt with by Sir David Bruce and 


his colleagues. 
animals. 


Human Strains. 


A: Compounds from Various Animals*. 


Let us first consider the human strains compounded from various 
The following table gives the length distributions : 


Microns. 
| 18 | 19 | 20 | 21 | 22 | 23 | 24 | 25 | 26 
Strain I, Mkanyanga ... 4] 19 | 42 | 63! 81) 75) 91) 65) 66} 93] 91/107 
” ) 2 2 | 12 | 55 | 108] 159 210 | 188 | 215 | 177) 138) 83 
» II], Chituluka 1 8 | 48 | 81 | 78| 71] 44] 46] 56) 53) 98] 120] 
» LV, Chipochola ... 2] 4 | 32 | 68 |110/101/109|106) 95° 95] 74] 64| 
» V, Chibibi 1 8 | 20 | 58 | 117(122|123,107| 93) 98] 63} 51 
| 
— 
Sum 41 | 154 | 325 | 494 [oes 577 | 512 | 525 | 511 | 464 | 425 
Human Strains. A: Compounds from Various Animals—(continued). 
Microns. 
38 | Totals 
Strain. I, Mkanyanga . 1220 
ee al B 1500 
» Ill, Chituluka ... 1500 
IV, Chipochola ... 1000 
V, Chibibi 1000 


6220 


We may conipare the strains precisely as in the case of the wild G. morsitans 


We find: 


Strains I and II: 
Strains I and III: 
Strains I and IV: 
Strains I and V: 


strains. 


Strains II and III: 


Strains II and IV: 
Strains II and V: 


Strains III and IV: 


Strains III and V: 
Strains lV and V: 


x? = 408°50, 
x? = 204-99, 
x? = 180°63, 
x” = 20540, 
x” = 923°62, 
x foi 0k, 

veh, 00, 

x? = 53132, 
x? = 563°82, 
x? = 16°81, 


* R. S. Proc. Vol. 86, B, pp. 287, 291, 295, and 297. 


B, p. 423. 
Biometrika x 


P =< -000,000,01, 
P =< :000,000,01, 
P = < :000,000,01, 
P =<:000,000,01, 
P = <:000,000,001, 
P =< -000,000,5, 
P =< 000,000,5, 
P = <:000,000,01, 
P = <:000,000,01, 
P =7733. 


For Strain I see R. S. Proc. Vol. 85, 


15 


114 A Study of Trypanosome Strains 

Again we have the remarkable result that all the human strains are statis- 
tically divergent beyond any possible comparison, except those of Chipochola and 
Chibibi which show a high degree of correspondence. Now is this result the 
outcome of treatment? We note the following diversity of hosts : 


Strain I. Strain II. Strain III. Strain IV, Strain V. 
Gross | Percentage | Gross | Percentage | Gross Percentage | Gross | Percentage | Gross | Percentage 
Men 60 4°9 — 0-0 - 0:0 — 0 — 0:0 
Monkey ... 100 8:2 160 10°7 160 1KO}27/ 160 16°0 160 16:0 
Goat 20 16 60 4:0 80 5°3 80 ‘0 80 8:0 
Sheep 60 4°9 20 1:3 1 0-0 ‘0 — 0°0 
Dog ss 260 21°3 260 17°3 260 17°3 260 26°0 260 26°0 
Guinea Pig | 120 9°8 _ oOo | — 0-0 — ‘0 — 0-0 > 
Rat 600 49°2 1000 66°7 1000 66°7 500 50°0 500 50°0 
Totals 1220 — 1500 — 1500 — 1000 — 1000 — 


Now it will be clear at once that the percentages of trypanosomes drawn from 
various types of host are identical only in the case of Strains IV and V, which we 
have found in close accordance. But there is not great divergence in source 
between Strains II and III although Strain I shows fairly wide differences. We 
find, however, that II and III are statistically very unlike, the next closest 
resemblances, although very slight, being between II and IV and V. It would 
not seem therefore that the degree of similarity is wholly determined by similarity 
of hosts. I have accordingly reinvestigated the five human strains by taking rats 
only. But, of course, even then it is of vital importance to be certain that the 
process of transfer from man to rat was the same in all five cases, and of this no 
evidence is provided. 


Human Strains. B: From Rat only*. 


Microns. 

15 | 16 | 17 | 28 | 19 | 20 | 22 | 22 | 29 | 24 | 25 | 26 | 27 

| 
Strain I, Mkanyanga =e lie 1 | 21 | 40 | 52 | 49 | 80 | 31 | 36 | 33 | 48] 52 
» LL, 5, Rat 728 .~f—|—| 2] 4] 15 | 30] 57 | 72 | 85 | 72 | 59 | 44 | 26 
» I, E, Rat 796... --... | — |— | 2 24 | 30.) 42 |60))) 61 "87 | 78ul soulmoralnoo 
» III, Chituluka, Rat 952 | 1 | 3 | 21 | 27 | 23] 15] 10] 15 | 19 | 21 | 34 | 44 | 36 
3 ILI, Chituluka, Rat 953 — ih 17 | 26 | 20 | 19 | 15 | 14) 26 | 18 | 33 | 40 | 34 
,» IV, Chipochola, Rat 1337] — | — | 4| 6] 16| 29 | 53] 61 | 59 | 69 | 56 | 51 | 36 
.) V, Chibibi, Rat 1660... | —- | -— | — | 4 {17 | 29 | 46 32 | 69 | 73 | 52 | 40 | 31 
Sum Ne Ete via 1 | 5 47 | 112! 161 | 216 | 290 | 316 | 376 | 362 | 322 | 294 | 235 


* R. S. Proc. Vol. 86, B, pp. 288, 289, 292, 293, 295, and 298. For Strain I see R. S. Proc. 
Vol. 85, B, p. 423. 


Karu PEARSON 115 


Human Strains. B: From Rat only—(continued). 


Microns. 
: Totals 
Strain I, Mkanyanga 600 
eT. E). Rat 728 500 
55 II, E, Rat 726... dis 500 
» II], Chituluka, Rat 952 500 
» III, Chituluka, Rat 953 500 
» LV, Chipochola, Rat 1337 500 
i V, Chibibi, Rat 1660... 500 
Sum... male ... [219 | 210] 134| 108} 88 | 57 | 28 | y) S| dl 1 } 3600 


This table with its two pairs of rats inoculated from the same strains is 
peculiarly instructive. We can compare II, Rat 726, with I, Rat 728. 


We find: x? = 36195, giving P =-0048. 


This is far from the high degree of divergence we have found between the com- 
pound human strains, but it is not satisfactory as a measure of the agreement of 
the same strain in two hosts of the same species. 


Applying the same test to the two Rats 952 and 9538 of Strain III we have: 
Vv? =14715, giving P="9038. 


This is, of course, quite satisfactory. We should not hesitate to assert identity 
of strains and of treatment in the case of the trypanosomes from these two rats. 
The statistician will feel fairly confident that there is a factor of divergence 
between the trypanosomes of the two rats in Strain IT, which does not occur in 
the two rats of Strain III. He will be almost certain that the strain was not 
conveyed through the same steps or at the same stage of the disease to the rats in 
Strain II. Unfortunately dates and processes are not discussed. Sir David Bruce 
and his colleagues say that it is remarkable how much alike these distributions for 
Rats 726 and 728 are, and again for the distributions for Rats 952 and 953 that 
they also closely resemble each other. “It is curious and striking that the same 
strain of trypanosome growing in two different animals should show this remarkable 
similarity*.” The interesting point is that the statistician would agree with the 
remarkable similarity in the latter case, but the divergence not the remarkable 
resemblance in the first case would force him to seek for some explanation in 
treatment. It will, I think, be clear from these illustrations that a strain of 
trypanosomes, even if obviously compound, can be taken from a single source and 
after inoculation into two different individuals of the same species be identified 
as same; but to insure this result on every repetition the greatest caution will 
have to be exercised as to identity of process and treatment. 


* R.S. Proc. Vol. 86, B, pp. 289 and 293. 


116 A Study of Trypanosome Strains 


There are still further results of importance to be ascertained, however, from 
our table of human strains. Let us compare Strains IV and V, which we found 
resembled each other closely even for compounded hosts. We now reach 


x? = 14085 and P=:5229. 


Or, the probability that these two strains are identical has been reduced by 
selecting out the rat data only. But the result is still so high that no one would 
hesitate to assert that Chipochola and Chibibi were suffering from a disease due 
to the same strain of trypanosome. The correspondence is so close that we have 
combined Strains III and V for all other comparisons. In the case of Strain ITI, 
we have added together the results for Rats 952 and 953. Such addition is less 
reasonable for Rats 726 and 728, but without doing this, it is impossible to decide 
which rat is to represent the E strain. I have then made the following com- 
parisons : 
Strains IV and V with III: y*?=525-67, P <:000,000,01. 

There is accordingly no similarity at all between the Chituluka strain and that 

common to Chipochola and Chibibi. 


Strains IV and V with IT: y? = 64°70, P < ‘000,001. 


Thus the strain from the European E from Portuguese East Africa diverges 
from the Nyasaland strain widely, but not as widely as that of Chituluka does 
from those of Chipochola and Chibibi. 


Strain I with Strain HI: y? = 12613, P <:000,000,1, 
Strain I with IV and V:  y? = 21782, P < :000,000,01. 


Thus the trypanosomes from Mkanyanga are widely divergent from those of 
the three other Nyasaland cases. Nor are they any closer to the European E: 


Strain I with Strain Il: yx? = 331°37, P <:000,000,01. 


Thus with the exception of the Chipochola and Chibibi strains, the trypanosome 
distributions from human sources differ widely. Nor is this to be wondered at, if 
the human beings owe their trypanosomes to Glossina morsitans, for in that case 
we should expect the human strains to be as diverse as we have found those from 
the tsetse fly itself. It would remain to explain the close similarity of the 
Chipochola and Chibibi cases. It would be interesting to know the history of 
these cases with regard to locality and to the possibility of a unique source 
of infection. 


(c) In the case last dealt with, namely that of Chipochola and Chibibi, we 
have the remarkable feature that the strains although significantly identical, 
whether treated in the rat alone or in compounded distributions from various hosts, 
resemble each other somewhat less closely in the single host series. This is not 
generally the rule. Some of the big divergencies we have already noticed become 
far less appreciable, nay, even become resemblances when we confine our attention 
to one species of host. The chief misfortune which then too often arises is the 


Kart PEARSON 117 


paucity of the total numbers that we have at our disposal. I will consider, 
however, from this aspect the relations of the three strains wild G. morsitans, 
wild game, and Mvera cattle. 

I compare first the lengths of 200 trypanosomes from wild G. morsitans and 
wild-game strains. These yield for the host, goat* : 


Microns. 

| Satie 
From Goat 9 | 10 | TA) 12. | 13 | 1h | 15 | 16 | 17 | 18 | 19 Totals | 
é | | Ps aa | 

Wild G. morsitans Strain | 1 | 3 | 12 | 21) 55 | 60 | 32 | 12) 4 | —| — | 200 
Wild-Game Strain Hae | —|}— | Wk | 37 | 73 | 38 | 26) 8 2 200 | 

| 
giving : x? = 26782 and P=-0015. 


To further test this, I take the same two strains in the dog as host+: 


Microns. 


| ere ee | 
From Dog D0 | IED RED HES NGI A GESY IRE TRS | is} | Totals 

i onl | fe 

| eas | ee ee | 
Wild G. morsttans Strain 5) 3 . 34 41 | 40/19) 9 | — 160 
Wild-Game Strain ae al a 31 | 57 | 50 | 24 | 6 | —| 180 

| 
| \ | | 
Here v= (045 and P= 3171. 


The value we had previously found for a mixture of all strains was P = -0002. 
Thus the two strains may be considered as identical when we deal with the 
trypanosomes from the dog, as showing considerable divergence when we take the 
goat, and as showing marked divergence when we take a great variety of hosts. 
The weight of evidence in favour of a standardised treatment thus becomes very 
great. 


Let us look at precisely the same material for the wild-game strain and for 
the Mvera cattle strain, first for the goat and then for the dog as host+. The 
grave difficulty is the paucity of measurements thus differentiated. 


Microns. 
i =a Eee aaa | | : ie yi eel | lo | 
| From Goat 9 | 10 | 11 | 12 | 13 | 14 | 15 | 16 | 17 | 18 | Totals | 
| : SS | : | 
Wild-Game Strain ...|— | — | 1 | 16 | 37 | 73 | 38] 26| 8 | 1 | 900 

| Mvera Cattle Strain ... | 1 1 OF la 225260) TOE sa 1} ==) 100 
| ed | 
This gives x’ = 14670, leading to P =-'1013. 

* R.S. Proc. Vol. 87, B, pp. 6 and 11. 

+ R. S. Proc. Vol. 87, B, pp. 6 and 11. 


by by 


. S. Proc. Vol. 87, B, pp. 3 and 5 


+ 
+ 


118 A Study of Trypanosome Strains 


Microns. 
| From Dog | 11 | 12 | 13 | 14 | 15 | 16 | 17 | Totals 
eee | ps ee Me | 
| : | | 
| Wild-Game Strain soe fe IQ ASI D7 de bOM24alaG | 180 
Mvera Cattle Strain | || Lia O eel 8 | — 100 
| 
| 


I 


This leads to x? = 15992, P="0138; 


Previously (p. 98) on the total series of different hosts we had found 
P=-000,243, Thus by referring our material to individual hosts, we have reduced 
the degree of divergency between the wild-game and Mvera cattle strains, but it 
would be still hazardous to state that these strains are identical. 


Lastly, we turn to the Mvera cattle strain and the wild G. morsitans strain 
dealing with dog and goat as hosts separately * : 


Microns. 


From Goat ako) |) ib ae 1h 15 | 16 | 17 | Totals 


| | 


Wild G. morsitans Strain ... | 1 Sale | 21 | 55 |.60 32 | 12 | 4 200 
Mvyera Cattle Strain eeu Cal 1 3] 14 22 | 26 | 19°) 2372 100 
| | 


This gives x= 1-968, P = °4368. 


And again : 


Microns. 


| | 

From Dog 9 | 10| 21 | 12 | 13 | 24 | 15 | 26 | 17 Bas 
| 

ee eee ae eer 

5 5 | 
Wild G. morsitans Strain... | — | -— | ela: | 34 | 41 | 40 | 19 | 9 160 
Mvera Cattle Strain | —|—| 3/11] 27) 30} 21-4 8 |= 19 ico 

| | | | | | | | | 

resulting in x? = 11:120, P=:0852. 


The Mvera cattle strain and the Glossina morsttans strain had for all hosts a 
divergence measured by P= 000,008. Thus the great bulk of this divergence 
is due to multiplicity of hosts f. 


To sum up the results obtained for 7. pecorum in Mvera cattle, wild G. morsitans 
and wild-game strains, the identification of these strains was quite illegitimate on 
the basis of the compound host frequencies. It is reasonable on the basis of 

* R. S. Proc. Vol. 87, B, pp. 3, 10—11. 


+ It is worthy of note that in comparisons with the cattle strain the goat appears to give closer 
results than the dog, but the dog appears the better in the comparison of the G. morsitans and wild- 


game strains, 


Kart PEARSON 119 


trypanosomes taken from a single species of host. But how far the resemblance 
in these cases is produced by a selective influence of the host and not necessarily 
by an identity of all the members of the strain before transference to the host is 
not demonstrated. 


On the other hand while divergence due to host will account for the divergences 
which are so notable in 7. pecorum, it will not account for the divergences in the 
human strains; these are startlingly conspicuous even if we confine our attention 
to a single species of host. Precisely the same remarks apply to the trypanosomes 
similar to those causing disease in human beings found in wild game and in the 
tsetse fly itself. There must be another source for these divergences. 


(7) Discussion of the Heterogeneity which is statistically demonstrable in the 
bulk of the Trypanosome Measurements. 


The reader who has attentively followed the course of the argument in the 
previous sections will be prepared for the next step in this memoir, the attempt to 
account for the large divergences between strains of trypanosomes in individuals 
of the same species by the heterogeneity of those strains. My suggestion is that 
the strain in one fly differs from that in another because the components do not 
appear in the same proportion, the strain in one specimen of wild game from that 
in another, or in one man from that in another because they have been bitten by 
a fly containing the components in unlike proportions. The host does make some 
difference, either by nutrition or selection of trypanosomes, but it is a minor differ- 
ence. Thus consider what we may probably hold to be pure strains and observe 
the average differences in length found by Sir David Bruce and his colleagues: 


Microns. 
T. pecorum 
T. simiae* T. caprac | 
Mvera Cattle + Wild G. morsitans § 

Goat 17°3 | Waterbuck 26°8 | Donkey 13°5 | Goat 13°5 
Monkey Loa Ox 25°7 | Ox 14:2 | Monkey 13°6 

— Goat 25°3 | Goat 13°8 | Dog 14:2 

— Sheep 25°6 | Dog 13°8 | Guinea Pig 14°6 

— | — Rat 14°8 | Rat 14:0 
Max. Difference 0°8 Max. Difference 1°5 | Max. Difference 1°3 | Max. Difference 1:1 | 


We may thus anticipate that in a pure strain the change of host would hardly 
make a difference of more than 2 microns in the average length. We must 


* R. 8S. Proc. Vol. 85, B, p. 479. : t+ R. S. Proc. Vol. 87, B, p. 3. 
+ R. S. Proc. Vol. 86, B, p. 279. § R. 8. Proc. Vol. 87, B, p. 10. 


120 A Study of Trypanosome Strains 


accordingly be prepared for some such change as this in the shifting of the mean 
when the host is varied. 

We have next to inquire what type of curve accurately describes the strains 
which we are fairly certain are homogeneous. 


If the reader will turn back to p. 110 he will note at once a marked difference 
between the distributions for 7. caprae, 7. pecorum and T. simiae when compared 
with those entitled Mzimba strain, human strain, wild-game strain, 7. brucei, 
T. rhodesiense, T. gambiense and the wild G. morsitans strain. The coefficients of 
variation of the former group are all under 9°5 (mean = 9:00), the coefficients of 
variation of the latter group are all over 13°5 (mean = 17:29). We recognise 
therefore a totally different order of variability. Even in absolute variation as 
measured by the standard deviations we find the first group with its mean 
S. D.= 1°68 and the second with its mean 3:96. An examination of the graphs 
scattered through the trypanosome papers to which we have referred will, we think, 
convince the statistician that we have to deal with heterogeneous and not skew 
homogeneous material*. It becomes of course important to ascertain whether in 
the pure strains a Gaussian curve will suffice to describe the frequency closely 
enough for statistical purposes, for, if it does, the analysis into at any rate two 
Gaussian components of the heterogeneous strains becomes relatively direct, if 
laborious. I will consider the 7. pecorum, T. simiae, and T. caprae strains 
from this standpoint. 


(a) T. pecorum (see p. 110). 
Mean = 13:992 microns. S.D.=1:2816 microns. 


boa Observed | Calculated 
Microns Values | Values 
9 and under 2 0°46 
10 6 | 5°98 - 
11 42 45°41 x°=7°630 
12 193 192°52 P= 572 
138 452 456°70 
Ls 618 607°12 
15 453 | 452°49 
16 178 188-98 
LT 51 44°16 
18 and over 5 | 6°20 


Hence in 57 out of 100 trials from material following the Gaussian distribution 
a more divergent sample than that observed would actually be obtained. We can 
therefore conclude that a simple Gaussian frequency adequately describes the 
distribution in size of 7. pecorwm. This is illustrated in Diagram II. 

* Note especially the bimodal graphs in R. 8. Proc. Vol. 83, B, pp. 5 and 11, for both the Uganda 


and Zululand strains of 7. brucei, in Vol. 86, B, pp. 291—293, for human strains, in Vol. 86, B, 
pp. 395, 397 for wild-game strains and pp. 409, 411, 417 and 419 for G. morsitans strains, 


KARL PEARSON 121 


Total 2000. 


Frequencies per Micron. 


1 
' 
' 
' 
t 


OOo OAS S16 MSem9 
Microns. 
Driacram II. Gaussian fitted to T. pecorum Frequency. 
(b) T. simiae (see p. 110). 
Mean = 17'870 microns. S.D.=1°6558 microns. 


Microns | Observed Calculated 
Values Values 
14 and under ui 10°46 
| 15 28 27°63 
16 | 76 63°92 x?=8'149 
17 93 103°78 P= +520 
18 126 118°32 
19 92 94°66 
20 47 D878 
21 22, 20°96 
22 6 5°80 
23 and over 3 1:29 
150 
140 ; 
3 130 
2 120 
S$ 110 
Ss 
S 100+ 


Frequencies per Micron. 
QO 
fe) 


(OMmmsMNaee Se diemly is) 19-9901 21 Of a5) oO 


Microns. 


Dracram III. Gaussian fitted to T. simiae Frequency. 
Biometrika x 16 


122 A Study of Trypunosome Strains 


We conclude that the Gaussian adequately describes the distribution of 
T. simiae. In more than half the trials we should get a worse sample. See for 
graphical fit, Diagram ITI. 

(c) T. caprae (see p. 110). 


Mean = 25°508 microns. S.D.=2°1011 microns. 


Microns Observed Caleulated 
Values Values 
| 20 and under Aas 4:98) | 
21 8 9°82 
22 23 23°95 
23 49 46°74 x?=5°175 
24 79 73°05 ae 
25 95 91°38 val 
26 80 91°54 | 
27 | 68 73°45 
28 57 47°16 
29 24 | 24°26 
30 9 9°98 
| 31 and over 4 | 4°38 


This is a still more excellent fit; if the Gaussian represented the population, 
in 92 °/, of samples we should get a more divergent sample than that observed. 
The curve is given in Diagram IV. 


150 

140- 
2 120 te 
: 110, = 
4 10 
é 90, 
= 80 
= 70 
= 60 
eS 50 
S 40 
S 


al a 
va 
IN 
és os 
19° 90 QI 29)" 93) 94995" 96) 227) (2829 sOn sl asa 


Microns. 


Diacram IV. Gaussian fitted to 7. caprae Frequency. 


It will be clear from the above three illustrations of what we may term 
homogeneous trypanosome strains that the Gaussian curve of frequency suffices 
to describe adequately such material. It is equally clear that no Gaussian can 


Kart PEARSON 123 


possibly describe such skew distributions as we get in the wild-game strain or wild 
tsetse fly strain of the trypanosome species identified by Sir David Bruce and 
colleagues as 7. rhodestense*. It is equally impossible in the case of the human 
strains figured in the paper of February 1913+. I illustrate this on the frequency 
distribution for 6220 trypanosomes of human strains}. 


Observed Calculated Observed Calculated 
14 and under 1 | 75°45 26 425 | 520°55 
5 10 =| 62°51 I a7 | 372 444-42 
16 | 41 101'51 —s| 28 347 || 35796 
17 154 | 155°50 | 29 | 307 271°88 | 
18 325 | 224°73 I 30 | 198 194-81 | 
19 494 306-27 | 31 | 167 | 131°68 
20 528 393-73 | 32 123 83°91 
21 577 | 47739 83 77 50°44 
22 512 545°93 | 34 36 28°61 
33 525 588-91 | 35 | 12 15°30 
24 511 599°17 | 36 and over | 14 | 14:18 
25 464 | 575°04 | | 
| 


Here y?= 501 and P< ‘000,000,001. In other words description by a Gaussian 
is absolutely impossible. The histogram of observations and the curve are shewn 
on Diagram V. 

Now the suggestion that flowed at once from these results was the compound 
nature of all the material classed under the headings : 


G) TZ. rhodesvense. 

que) L brucer. 

Gu) TZ. gambiense. 

(iv) Mzimba Strain. 

(v) Wild G. morsitans Strain. 
(vi) Wild-Game Strain. 

(vii) Human Strain. 


With the experience of the Gaussian fitting the homogeneous strains, the direct 
step was to investigate whether the above material could be analysed into two 
Gaussian components and to determine how nearly these components were in 
agreement. The method of carrying out this analysis was provided in the first 
of my series of Contributions to the Mathematical Theory of Evolution§, There 
was nothing to prevent the process being applied to every individual frequency 
given by the trypanosome workers, except the very laborious arithmetic. The 
method was applied to the above seven cases, and also (viii) for the purposes of 
illustration to a single human case, that of Chituluka, a native of Nyasaland, who 

* See R. 8. Proc. Vol. 86, B, pp. 407 and 419. 
+ See R. S. Proc. Vol. 86, B, pp. 285 et seq. 
See R. S. Proc. Vol. 86, B, p. 300. 


- 
a 
§ Phil. Trans, Vol. 185, A, pp. 71—110, 1894. 

16—2 


A Study of Trypanosome Strains 


124 


‘soumosouvddiy, weuny_ Jo uowynqraysiq Aouonbely 4g 04 UvIssney jo oInTIVq “A NVUOVIG 
“sUuOoLoryy 


ge LE 9E GE VE SE GE LE OF 66 BG LG YS GB VS ES GS 1G OG BL gL LL SieSESvE Se Gl Ie OL 


i ine ane 
BN irae 
At ey 
N 
s 
\ 


fe) 


004 


OO 


OOV 


00S 


009 


OOZ2 


008 


‘uolay dad sarauanbaty 


0669 1PI0L 


KARL PEARSON 125 


died of sleeping sickness*. With the single exception of 7. brucei every one of 
these distributions broke up into two components, and into two components with 
strikingly close means. I propose to call these two components 7. minus and 
T. majus. I do not assert that they are distinct species; they may be dimorphic 
groups of one and the same trypanosome species. But the recognition of their 
existence seems to bring some order at least into the chaos we have already noted 
as existing in the trypanosome measurements. ‘Two human strains or two wild- 
game strains differ from each other with such wide divergence in their frequencies 
because these two groups 7. minus and T. majus are mixed in the individual 
in different proportions. 


| Standard Coefficients of Size of 
MSs Deviations | Variation Populations 
Strain = =o |e 
T. minus | T. majus | T. minus! T. majus | T. minus | T. majus | TL. minus T. majus 

T. rhodesiense ...| 18°7418| 26-1122 | 2°3184 | 3-4397 | 12-370 | 13-173 eae Bees 

7. brucei... «|| 19°8244 261122 2-6439 | 34134 | 13°337 | 13-072 Peace eos | 
“T. gambiense _...|19°8926) 26-2463 | 2-0566 | 26260 | 10-339 | 10-005 |) ORY. | | BNE 

Feamailecus apr SOD Re Pad iee N GokE ( 634-96 | | 365-04 

| Mzimba Strain ...| 19°8966 | 24:0508 | 1°3961 | 3:1028 7°017 | 12-901 ) 635°, |) 365 om 
| @. morsitans Strain | 19°6475 | 27-1966 | 1°7503 | 2-70138 | 8-908 | 9-932 eae 7 aa ’ 

Wild-Game Strain | 20-4418 | 25-8263 | 16332 | 2-8799 | 7-990 | 11-151 ae ae es 

OO"D eon ipa dO bet 
i 9e . F . 
Human Strain ...| 20-3687 | 26-2930 | 1-9444 | 3-4470 | 9°536 | 13-110. ercee “1 | woe 7 
Chituluka ... ...| 19°8410 | 28-7875 | 1-9785 | 2:8823 | 9-972 | 10-012 ae hoe y 
Means... _... | 19°8315 | 25-9542 | 1-8498 | 3-0328 | 9-360 | 11-712 | 9 —- ee 
T. simiae ... —... | 17°870 Be wp OOS ae aS 2700 = 100° - 
lo 
T.caprae... ..{ — |25:508 | — |21011| — | 8-580 = rae 
| | | 100 °/, 
| i 2 | 


The table below gives the chief biometric characters of 7’. minus and 7’. majus 
as found from the seven resolutions. The mean values of the constants for 
T. minus and for T. majus are placed at the foot; in calculating these mean values, 
Chituluka’s data have been excluded as already included in the human strain, and 
also those for 7. brucei not directly resolved. 

At the foot of the table I have placed the constants for 7’. simiae and T. caprae, 
the nearest pure strains to 7. minus and T. majus respectively. I do not in the 


hase Proc: Vol.86, By ip. 290. 


126 A Study of Trypanosome Strains 


least suggest there is any identity, but comparison may bring home to the 
trypanosome worker the average sizes of the two components*. The differences 
of the variabilities are, however, much larger, and the influence of host on 
variability as well as on mean ought to be studied. 

It will be seen at once that the divergence in the individual means of 7. minus 
from the general mean is very slight, at most a micron, and well within the limits 
which arise, as we have seen, from difference of host. It is a most remarkable fact 
that from six independent reductions the mean size of JZ’. minus should come 
out so nearly 19°8 microns. In 7. majus the correspondence is not so good; the 
average of about 26 microns falls to 24 in the Mzimba strain and rises to 28°8 in 
the case of Chitulukat. Still it does not appear to me that these changes of 
mean of the 7. majus component are absolutely beyond the variation due to differ- 
ences of host and treatment. Another more serious matter is the comparatively 
wide range found for the variabilities ; but even here it is impossible to assert that 
such differences will not occur with difference of host. For example the Mvera — 


cattle strain, a fair sample of the simple 7’. pecorwm, gives: 


| Fost M | Standard Coefficients of 
Ot tas | Deviation Variation 
Goat 13°80 1°462 10°592 
Rat 14°75 *839 5689 
Dog 13°79 1:087 7°885 


Here while the means are within one micron, the differences in variability are 
of the same order as those found in 7. majus from different hosts. 


Again, taking a pure homogeneous strain as 7. caprae with goat and sheep as 
host, which are scarcely so differentiated as man and antelope, we find: 


Hoc Mean Standard Coefficients of 
Deviation Variation 
Goat 25°31 2-187 8°642 
Sheep 25°60 1°92¢ 7512 
| 


Lastly, taking 7. simiae for goat and monkey we have: 


r Standard Coefficients of 
| Eos lean Deviation Variation 
| 
(ea ae eee — - aa | 
| Monkey ... | 17°26 1-403 8127 | 
| Goat | 18°11 1°687 9°315 | 


* The maximum average length of 7’. caprae is 26°8 in the waterbuck and of 7’. simiae 18:1. 
+ It should be noted that with the whole of the human data the mean is 26°33 and that Chituluka’s 


mean is very exceptional. 


Kart PEARSON 127 


I think we may conclude that, allowing for the errors of random sampling 
and the errors arising from the resolving process, the deviations observed in the 
variability of our two components do not invalidate the hypotheses : 


(i) That the widely divergent results obtained from different strains are due 
to the existence in the same individual of two types of trypanosome with very 
varying percentages from individual to individual. 


(ii) That one of these types has a mean length of about 19°8 microns and a 
variability of about 1°8 microns, the other a mean of about 26°0 microns and 
a variability of about 3:0 microns. The means may vary 1 or 2 microns with the 
nature of the host and the variability 0°5 to 1 micron. 


The large type predominates in the Nyasaland human strains*, on the average 
in about the ratio of 3 to 2, but the smaller type predominates in the G. morsitans 
and wild-game strains in about the same ratio; while in the trypanosomes classed 
as T. rhodesiense, and 7. gambiense as well as in the strain from the Mzimba 
donkey the preponderance is still cf the smaller type and the ratio approaches 
13 to 7. Whether these ratios are peculiar to the host or due to the infecting fly, 
it is not at present possible to determine. But the hypothesis of the existence of 
these two types,—whether as a dimorphism of 7. rhodesiense or as independent 
species seems to bring some order into the apparent chaos of recent trypanosome 
measurements. 


The following paragraphs give the calculated constants of the reductions, and 
the numbers of the diagrams showing the nature of the compound frequencies: 
Ga) T. rhodesiense. 
Mean = 23°577, 
fly = 21°86874, fs = 1079°10255, 
Ms = + 401986, Ms = + 1105°74834. 
Reducing nonic: 
249° — 298-7232q' — 5817q° + 1114°7684q° + 34°7620¢' 
— 117924954? + 12°9808q? + 0891g + 0001 = 0 
wheret p,=—10q. 


The root is p,=—12:2578. This leads to the two components in the Table 
p. 125. The histogram of the observations and the two component Gaussian 
curves with their compound are given in Diagram VI. 


The resolution is not a very good one; for 24 groups y?= 37°48, and P =:05, or 
once in 20 trials only we should get a worse result. But an examination of either 
the graph or the original frequency shows at once the cause of this divergence. 
In their measurements Drs Stephens and Fantham have had a strange bias in favour 


* The European from Portuguese East Africa had predominance of T. minus. See R. S. Proc. 
Vol. 86, B, p. 288. 


+ Notation of the memoir Phil. Trans. Vol. 185, A, p- 84, Eqn. (29). 


ins 


y of Trypanosome Sti 


e 


A Studi 


128 


‘snfvwm “J, pus snuru “J, oJUt asuarsapoys “JZ Jo Aouanbergq oy} JO UOTJNTOSey “TA WVUOVIG 
“sUuoLOUTT 


66 ee Ze 96 GE ve SE GE IE OS 66 8G 46 96 GG VG 86 GO IG OG Gi Bl Zi Ol SI VI Et aos tt Ob 


arawan Dau 


ad s 


‘wouoryy 


‘OOOT 1270.7, 


~ 
' Kart PEARSON 129 
| of even numbers. No curve whatever could fit the data satisfactorily under the 
7 circumstances! Either they used a scale graduated to 2 microns only, and had a 
7 prejudice in favour of the scale markings, or else their even numbers were in some 
way more conspicuous than their odd. Whatever the source of this peculiarity 

i” may be, there can be no doubt of the bias*. 

~| The only way to obtain a reasonable measure of the goodness of fit in Stephens 


| and Fantham’s results for 7. rhodesiense is to group from 10 to 12, 12 to 14 and so 

on in comparing the observed and calculated frequencies. If this be done we find 
x? = 5:03 for 13 groups and P=-957, a splendid fit. The frequencies are as 
follows : 


26-28 | 28-30 | 30-32 


10-14 | 14-16 | 16-18 | 18-20 


Observed | 9 | 38-5 | 93-0 | 133-5 
Calculated | 7-17 | 34°67 | 92-99 | 132-91 


20-22 | 22-24 | 24-26 }2-34,| 34-36 | 36-38 | Totals 


| 
134°0 | 127°0 | 1385°5 “1395 | 112°0 | 60° 
6 


5 15 1000 
124°79 | 124°28 | 146°35 | 145°56 | 106°55 | 56°22 | 21°36 | 5° 11 


"17 | 999°85 


{ 


(i) TZ. brucez. The data for this trypanosome were taken from Sir David 
Bruce and colleagues’ diagram+. I have not come across the original publication 
with the measurements involved in this diagram. Describing this species in 
: July 1910, the authors speak of its well-marked dimorphism. This is very 
obvious in the graphs for length given for the Uganda 1909 and Zululand 1894 
strains, but the numbers given are far too slender (160 and 200 respectively) to 
justify any attempt at analytical resolution. Graphically we may take it that 
roughly the following are the means of the components: 


T. minus. T. majus. 
Uganda 1909 20 microns 28 microns 
Zululand 1894 18 microns 29 microns. 


These are not very widely divergent from the values 


19°8 microns 26°0 microns 


we have found from the seven resolutions. 


In May 1911§ the two curves for Uganda and Zululand appear to be added 
together to give a 7. brucei curve of length distribution. This is again markedly 
bimodal with one component mean at 18°75 microns and the other at 27°5 microns, 
both approximative. Thus far 7. brucei appears quite well to fit in with our other 
material. But in September 1911 appears the diagram of 7. brucei said to be 


* Bias of this or of a similar character is not uncommon—even in the pages of this Journal. 
I remember once pointing out to a Scotch anthropometer his prejudice in favour of whole centi- 
metres. He looked at his results, recognised the bias, and then gravely told me that it was not 
due to any personal bias, but that the Creator must have designed Scotsmen on the metric scale! 

+ R. S. Proc. Vol. 84, B, p. 331. 

+ R. S. Proc. Vol. 83, B, p. 2. 

§ R. S. Proc. Vol. 84, B, p. 186. 

Biometrika x 17 


130 A Study of Trypanosome Strains 


based on 1000 individuals, Here there is a mode about 240, with possibly a sub- 
mode at 19 microns, but the evidence for dimorphism has largely disappeared. 
It is very desirable that we should know the details of this curve, ie. the nature 
of the hosts and so forth, for it apparently replaces the earlier data and remains 
the standard 7. brucei distribution. It certainly shows nothing of the definite 
heterogeneity (or dimorphism) of the previous Uganda material. 

Its constants are as follows : 

Mean 28°5290, 
fy = 19°30583, fy = 996°87764, 
Le; = 10°54837, Hs = 2146°37930. 
249° — 10186189’ — 4°0057q° + 140°6937q° + 62:0835¢q' 
— 29°39409? + 11:2371¢@ + 1.44139 + ‘0331 = 0. 

No suitable root of this equation exists and accordingly it would appear that 
this distribution is not rigidly reducible to Gaussian components. This result is 
so remarkable in view of the obviously bi-modal character of the earlier 7. brucei 
distribution, and the resolution into two components of all the other seven 
distributions, said to be allied to 7. brucez, that I determined to consider the 
matter further by fitting Gaussians to the ‘tails’ of the 7. brucei distribution*. 
I chose as the right-hand ‘tail’ the frequency from 28 to 38 inclusive, and as the 


left-hand ‘tail’ the frequency from 13 to 18 microns-inclusive. The two resulting 
components were : 


T. minus. T. majus. 

m, = 20°0817 (19°83), My = 26-4359 (25-95), 
o, = 2°8685 (1°85), oy = 36399 (3:03), 
he O2o0, Ny = 467°52. 


The totals populations for each component are clearly not very good and their 
combination exceeds by 9°6 °/, the total observed population; but the means are 
not widely divergent from the average values resulting from our six resolutions, as 
the numbers given in brackets testify. Accordingly I determined to select the 
means of the components at values near the mean values of six reductions, and 
after one or two slight betterments, determine the sizes of the populations and 
their standard deviations so as to give the mean, and second and third moments of 
the observed population. These provided: 


T. minus. T. majus. 
iy = 198244, ms — 261122; 
ao, = 2°6439, o, = 34154, 
n, = 410°83, Nei Oooala. 


* Biometrika, Vol. 11. p. 1 and Vol. vi. p. 65. 


; 
a 
4 


ea = 


Karu PEARSON 131 


The following table gives the observed and calculated values : 


Microns Observed | Calculated | Microns Observed | Calculated 
B) | 5 | 3°44 26 82 72°74 
Ls 8 | 5°80 | Bie 2) 67°98 
15 14 12°25 28 50 59°71 
16 17 22°79 29 38 48°10 
17 40 37°05 30 Dif 36°04 
18 63 52°87 ruil 26 24°79 
19 55 66°68 82 18 15°67 
20 66 75°44 | Go iil 9°09 
21 63 78°43 BY A 4°84 
22 75 77°49 85 4 PET 
23 87 75°61 36 — | 
2) 93 74:71 BH = -1:75 
25 80 74:36 ©6|| 38 2 J 


From these results we find y?= 29°92 and P=:22. Thus more often than 
once in five trials we should get a worse divergence than the observed, if the 
sample were taken from the calculated population. Some endeavour was made to 
better the fit by small variations from the above solution, discussed by least 
squares, but no improvement was effected. The two components are represented 
in Diagram VII (p. 132). 

(i) =T. gambiense. 

Mean = 22°1130, 
fy = 14°3389, jis = 531°3585, 
fs = 29°1104, Hs = 2429-0948. 
Reducing” nonic : 
24¢° — 7178109’ — 30°5070q' — 300:02609q? + 869°6372q! 
— 278°8475¢q° — 270°9547¢? + 58:9108¢q + 146050 = 0. 

This leads to p,=—10qg=—91777, and the components given in the Table 
p. 125. The two Gaussians and their compound are given in Diagram VIII - 
(p. 133). We find y?= 11:96, giving for n’ = 18, P=°80 a splendid fit. 

Gv) Mzimba Strain (from Donkey). 

Mean = 21°4130, 
fg = 87531, pis = 2935629; 
és = 26°6602, bs = 1926°7045. 
The reducing nonic : 
24¢° + 53°5186q" — 25°5876q° — 4157069? — 171:2637¢4! 
+ 227°12119° — 37:3371¢ — 30°8995q + 86177 = 0. 

The required root is p, = — 10g = — 40000, which leads to the two components 

given in the Table on p. 125. The two components and their compound curve are 
17—2 


ins 


ypanosome Sti 


a) 


of T) 


e 


U 


{ Stud 


ok: 


132 


. etn 
+ Sem — 
= 


—— 


a 


‘sn(pu +7, pue snwuwu “7 oUt 2aonIQ “J Jo Aouenbarg oy} Jo UoYNosey [TA WvuSvIG 


"sUuo.oUyy 


ge JE 98 GE VE EE ZE LE OF 6 8B LZ eg ai VG 86 2S 1G G6 Obese Li Ol Gwe 


~ 


| 
| 
| 
A 
| 


LT 
| i 


Ol 


O€ 


OV 


OS 


09 


Ol 


08 


06 


OO t 


“uowonyy ad sarouanbawq 


“OO0T 27707, 


3 


. 
e 


13 


Kart PEARSON 


‘sn(pum “J, pues snww 7 oyut asuaiquob “yz yo ouanbarq oyy Jo uoynposay [ITA Nvasviqg 
*sUwoloryy 


Ge ve ES oe 1€ OF 62 8G LO 96 GB VG SG BG IG oe 6 st Zt OL Gi VI Si al 


+ 
\ 
) 
a4 


Odl 
O€l 
OvL 
OSL 


“uounryy dad sarouanbaw sy 


“‘OOOT 2970.7 


Total 1000. 


Frequencies per Micron. 


134 A Study of Trypanosome Strains 
figured on Diagram IX on this page. We have y?= 19:28, giving for n’=17, 
P=-26 a fairly reasonable fit. 
(v) Wild G. morsitans Strain. 
Mean = 22°6952, 
fy = 184918, fy = 7584420, 
fe; = 43°0246, Ps = 89548788. 
210 
200 
190 
180 


Mean 


170 
160 
150 
140 
130 
120 
110 
100 


+ i 


10 fj 


\ 
es a 
Bi. = 


21-229 23.24 05 -96 27 28° 29 30 (Sil sORscmes. 


“Microns. 


14 5) Ge 8! sg 


Diacram IX. Resolution of the Frequency of the Mzimba Strain into 7’. minus and T. majus. 


~~~ 


KARL PEARSON 135 


Reducing nonic: 
24¢° — 224°6115q’ — 66°6402q° — 595-9589 + 5079°3305q! 
— 4500°7030q? — 1460°5459¢? + 879'6116g¢ + 1522340 = 0. 

The required root is p,=— 10g =—4°75085, which leads to the components 
given in Table on p. 125. These components with their compound curve are 
drawn in Diagram X (p. 136). Here y?=92'75 which for 20 groups gives 
P< :000,000,1. Thus although the G. morsitans strain breaks up into two com- 
ponents the combined curve is not a probable description of the frequency. One 
would like to test another sample of this strain, at present it tells against the 
validity of our reduction. 

(vi) Wild-Game Strain. 

Mean = 22°6220, 
fe ol; fog = 404°4932, 
ty = 29°0514, by = 2247-6657. 
Reducing nonic: 
24q° — 18°94469' — 30°38349q° — 250°2869q° + 851°7475¢' 
+ 118°6154q? — 212°3972¢? + 15°4222¢ + 144283 = 0. 

The root required is p, = — 10g = — 6°9859. There result the two components 
provided in the Table p. 125. The two components and their compound are 
figured on Diagram XI. (p. 137). We find y?=12°61 giving for n’=19, P=°81, 
an excellent fit. 

(vu) Human Strain. 

Mean = 23°7963, 
by = 170252, jt, = 7131660, 
fs = 27-1889, Ms = 80341222. 
Reducing nonic: 
24¢° — 1381°3796q' — 26°5147¢q' — 89°8059q@’ + 96441764! 
— 67427559q3 — 114°7894q" + 8144929 + 95887 = 0. 

The root is given by p,=—10qg=—8'5576, which leads to the components 
given in the Table on p. 125. The two curves and their compound are figured 
in Diagram XII (p. 138). Although the two components merely from the 
graphical point of view do not give a bad fit, the number of trypanosomes in- 


volved is so large that the deviations are not reconcileable with random sampling 
trom two such components. We find y? = 79°67, giving P < ‘000,001. 


‘snip *J, pue snuww “J OYUT UTeI4YG supzisLow “DH PTIM ey} Jo Aouanbarg oy Jo uorynjosey “yx NvYDVIG 


“sUuoLorTy 


eS LES OGr Gor SGN 2G V6e UGG S666 AGE OGsG I SiveZ LAO Gina St 


% O 
eS 
RM 
S OOL 
S 
3 ; 
= | 
S = 
O 
S 00G 
(= 1 
‘ i] 
= ' 
> 008 
nS | 
YS 
= ' 
B 'S 
iS 
=x | OO¥V 
\ 
| 
| 
| 
OOG 


136 


uowoy ad hauanbawy 


00G6 1PIOL 


Total 2500. 


Frequencies per Micron. 


KARL PEARSON 137 


In order to determine how far heterogeneity of treatment or material might be 
responsible we took further frequencies. In the first place we dealt with the 3600 
measurements for trypanosomes through the rat only. The frequencies are: 


| | _ ; 
| | | | 
15 |16|17)| 18 | 19 | 20 | 21 | DODD 2 | 25 26 27 | Q8 "29 | 80 1 8L 132.83 34 | 35 | 86 | 387 | 38 Totals! 
| Py 

| | ms : a i) =| aa 
iL 5 | 47 |} 112 | 161 | 216 | 290 | 316 | 376 | 362 | 322 | 294 | 235 | 219 zu al 108 | 88} 57 | 28] 9 | 8) 1 1 | 3600 | 
| | ee Ie | | 
400 


350 


300 


250 


200 


150 


100 


50 


1415 16 1718 19 20 21 29 93 24 25 96 27 28 29 30 31 32 33 34 35 


Microns. 


Dracram XI. Resolution of the Frequency of the Wild-Game Strain into T. minus and T. majus. 


Biometrika x 18 


‘ains 


y of Trypanosome Sti 


St ul d 


A 


138 


‘snlpu *J, pues snuww *f OUI UNG UBUIN_ 94} Jo Aouonbeag oy} Jo uOoIyN[OSay 


| at 
IN 
-- 
\ 


*suoLorTy 


ge LE 96 GE ve FE GE IS OF 66 8G LG a GG ue €6 66 16 OG 6 Bi 2I GI Gi FI Et 


— e— er er Orr rer hl 


‘IIX WvuovIG 


va 


0 


001 


006 


00€ 


0Ov 


00S 


009 


‘wou sad fawanbay 


0669 2Y707, 


Kart PEARSON 139 


These give : 
Mean: 24°6175, 


ple = 15°25897, Hy = 602°23008, 
fs = 19°21542, fs = 2023°21556, 
leading to the reducing nonic : 
249° — 80°8739q' — 13:2924q" — 42°3159q? + 306°5227¢' 
— 166°4257q? — 248654q? + 12°6008q + 1:2081 = 0 


which gives po = — 10q = — 70031. 
This provides the two components : 
T. minus. T. majus. 
m, = 21:6772, my, = 26°9993, 
o, = 2°2404, T, =a 298), 
nm, = 1611:18, Ny, = 198882. 


The components and their compound are figured in Diagram XIII, p. 140, 
and we find for n= 21, y?= 52°68 and P=-00016. There has thus been much 
improvement of goodness of fit, although the result is still unsatisfactory. 


It is impossible, however, to look through the graphs given by Sir David Bruce 
and others for the human strains* without being convinced of their fundamentally 
bimodal character, although there appears to be much evidence of its being 
disguised by heterogeneity of host and treatment. 


(viii) Diagram XIV (p. 141) gives the resolution for the human strain from 
Chitulukat. The constants 


Mean = 26:172, 
fin = 2302260, Hy = 1179°30786, 
fy = — 3718226, bs = — 3248°43805, 
leading to the reducing nonic: 
24q° — 393°8678q' — 49°6370¢q° + 520°2910q° + 8226:94354' 
— 12493°5620q' — 101-101 7¢@? + 855°7520q + 63°2383 = 0. 

The value of the root is p,=— 10g =—16:2295 and this leads to the com- 
ponents given in the Table p. 125, and illustrated in the diagram. The graph 
while giving broadly some of the features of the case is by no means a satisfactory 
fit; for n= 21 groups, y?= 86 and P is < 000,000,1. The diagram suggests that 
we are probably dealing with a mixture of three components with means about 


18°5, 25°5 and 31-0, but at present we have no satisfactory method of performing 
multiple resolutions of this character. 
* R. S. Proc. Vol. 86, B, pp. 285—302. 


+ R. S. Proc. Vol. 86, B, p. 291. 
18—2 


————— 


wns 


y of Trypanosome Stra 


Studi 


A A 


140 


1 


‘snfpu “7 pure snuw *7, oyut ATUO syey Ysno1yy ‘ureyg weunZ, wos souosoueddry, jo Aouonberq oy} Jo uoNNosery 


*SUudLIUTT 


‘IITX NVUOVIG 


) 


OOL 


006 


00S 


‘uowary wad fauanbaasy 


“0096 17207, 


‘sn(vu “J, PUB sNUUW “7 OFT VYNTNIYO oaAeN oy} Woy soutosouvddry, yo Aouanbary oy} JO UOTN[osay “ATX NVUOVIG 
"swouarTy 


66 8c LE 9E GE VE SE GE IE OF 66 82 LG 9G GS VS GG GG WW OG GI Bl Al QI Gl vl El Gl 


141 


KARL PEARSON 


os 


OOL 


OSL 


uowny ad houanbawg 


00ST 1279.7 


142 A Study of Trypanosome Strains 


It will be seen that the following strains, 7. rhodesiense, T. brucei, T. gambiense, 
the Mzimba, and wild game, give either reasonable or excellent results as combined 
frequencies of 7. minus and T. majus. On the other hand the G. morsitans and 
the human strains break up into reasonable pairs of componeuts, but the goodness 
of fit test is not fulfilled. In the case of the human strain, we better matters 
somewhat by taking the strain through the rat only, but the fit is still bad. If we 
confine our attention to a single human being, the case of Chituluka, we still do 
not get a satisfactory fit, although few statisticians could look at the four diagrams 
published by Sir David Bruce and others for Chituluka*, and not recognise the 
character of the material as being at least bimodal. The same applies to the 
Mkanyanga data of an earlier paper}, it is distinctly bimodal. But besides this 
bimodal character there are certain other features in the human data, and to a 
lesser extent in the G. morsitans, which appear to some extent to disguise the 
bimodal features. I am not prepared to assert definitely that this is the appearance 
of a third component. It is of course easy to improve the fit of the distribution 
by the introduction of such a third component, but the remarkable excellence 
of a bimodal resolution for 7. rhodesiense, T. gambiense, and the wild-game strain 
makes me hesitate at present to adopt such an expedient. 


Owing to the courtesy of Sir David Bruce (who heard from Sir John Rose 
Bradford that I was much puzzled over the differentiation of strains) I have been able 
to examine a series of drawings of the various strains of trypanosomes. There is 
no other morphological differentiation which impresses itself a priori on the layman 
and statistician, and which might serve as anew measure of the possibility of differen- 
tiation into 7. minus and T.majus. But it occurs to me that an index of breadth to 
length of the nucleus might just possibly serve as a differential character of even 
more importance than the length. It is only a suggestion and considerable caution 
would have to be used in selecting only nuclei not near the dividing stage. But 
it would be of striking interest to see how far the resulting frequency distributions 
for the nuclear indices were or were not bimodal. I think a classification according 
to nuclear index might possibly—to judge from the drawings—cut across the 
forms “intermediate ” in length. But this is only a suggestion which may appear 
idle to the student of the subject?. Some difficulty might also arise from the 
doubt as to whether the index was really greater than 100, or the nucleus as 
a whole had set itself athwart the “length” of the trypanosome. This difficulty 
would certainly have to be considered in the “stumpy” 7. brucei and T. gambiense 


* R. S. Proc. Vol. 86, B, pp. 291 to 293. 

+ BR. S. Proc. Vol. 85, B, p. 428. 

+ Several students of the subject with whom I discussed the matter stated that they considered the 
nucleus so mobile and so impermanent in form, that a ‘‘nuclear index’ would prove of little value. 
I think much objection could a priori be raised to the use of the trypanosome ‘‘length” on the same 
grounds. ‘The problem is rather, whether in dealing with large numbers we do reach an average type. 
It would only be possible a posteriori to justify the use of a nuclear index, i.e. if it were found to differ 
sensibly from one pure strain to a second, and if it confirmed in such cases as 7’. rhodesiense resolutions 
based on length frequencies. 


KARL PEARSON 143 


forms, but I am inclined to think that the index really passes through the value 
100. Undoubtedly this range of index, or possible athwartness of the nucleus is 
not conspicuous in the simple strains like 7. pecorum, T. simiae and T. caprae. 


Conclusions. (i) If appeal be made to statistical measurements, judgment 
between identity and diversity of strain must be formed by means of accepted 
statistical processes and not by mere comparison of graphs. 


(11) Statistical processes show that the conclusions already formed as to the 
identity of trypanosome strains from mere inspection of the graphs cannot be 
confirmed. 


(111) There must be some standardised process of treatment both in regard to 
host, and to method of and stage of infectivity at extraction. 


(iv) Even making allowance for differences due to host and treatment, we 
find remarkable divergences in the very strains asserted to be identical. 


(v) It would appear that some order would be brought into the chaos, if we 
could consider the strains described as 7. brucei, T. rhodesiense, T. gambiense, the 
wild-game, the Mzimba, and very probably the tsetse fly and the human strains 
as really consisting of two components, which for the time I have termed 
T. minus and T.majus. It is highly desirable that additional measurements should 
be made (? a nuclear index ascertained) to determine whether these lead also 
to similar components. 


I do not assume that this is a final solution of the problem, nor do I assert that 
T. minus and T. majus represent necessarily, although probably, distinct strains ; 
they may be dimorphic forms of one and the same strain occurring in different pro- 
portions. But, I believe, that the suggestion of their existence may help to explain 
some anomalies of the present chaos. I ought also to state quite frankly that this 
paper is not written in a merely critical spirit. I believe that the trypanosome 
workers have undertaken in their elaborate systems of measurements most laborious 
and most valuable work, but, I think, the time has now come when without 
trained statistical aid, but little further progress will be made in a very important 
and urgent matter. 


The very large amount of arithmetical work in this paper would never have 
‘got carried through had I not had the ever ready assistance of my colleague 
Miss Julia Bell; to Mr H. E. Soper also I owe help in the arithmetical work, but 
I have to thank him in particular for the careful preparation of the diagrams, and 
the planimetric determination of their frequencies by aid of which the x? for all 
but two of the compound curves was found. In the case of YZ. brucei and 
T. rhodesiense actual calculation of the areas of the normal curves was used. 


ON HOMOTYPOSIS AND ALLIED CHARACTERS 
IN EGGS OF THE COMMON TERN 


By WILLIAM ROWAN, K. M. PARKER, B.Sc., anp JULIA BELL, M.A. 


(1) Origin of the material and method of measurement. 


The settlement of Common Terns, which provided material for the present 
work, is one of old establishment on Blakeney Point, Norfolk. This is a shingle 
spit of some 8 miles in length on the north coast of Norfolk, about 12 miles 
west of Cromer. The colony is situated on the very end of the point, with 
water on three sides. Here the spit is a combination of dunes, salt marsh and 
shingle, and for the most part the nests are found on the open shingle on the 
seaward side of the dunes. Nests are plentiful in the embryo dunes in some 
years, though this year (1913) none were found there. The colony was more 
scattered than usual and covered the greater part of a mile of sea front. To 
avoid missing any clutches, Miss K. M. Parker, B.Sc., and Mr William Rowan 
divided the nesting area into suitable well marked plots and worked these one 
after another. Each of these again were worked in strips, till a patch was com- 
pleted, when the workers moved on to a remote one, to give the birds a chance of 
settling down again. After measurement each egg was numbered with indelible 
ink, so that any one egg was never measured twice. In all 203 clutches were 


handled. 
(2) Reduction of the material. 


The principal part of the work of tabling and reduction was carried out by 
Julia Bell*. The characters dealt with were: 


(i) Length of Egg : : ; : : L 
(ii) Breadth of Egg, maximum value. : : B 
(ii) Lateral Girth at section with maximum procaine : Gr 
(iv) Longitudinal Girth . 5 ; : : : : i Gy 
(v) Length-Breadth Index. ; : B/L 
(vi) Mottling, as determined from a Pas of one eggs. M 
(vii) Ground Colour, as determined from a tint scale. ; C 


* The authors have to thank Miss B. M. Cave for certain tables and their correlation coefficients. 
The Editor is responsible for the actual wording of this paper. 


W. Rowan, K. M. Parker ann J. BELL 145 


The Leneth of eze LZ may be considered as the easiest character to determine 
fo) {=} 
and needs no further comment. 


The Breadth of egg B should be closely related to the Lateral Girth G,, and 
in most cases the relationship G,=7B is very closely satisfied. If we sum and 
take the means we have 


a = Mean Lateral Girth/Mean Breadth. 
This gives in the present material : 
mw = 3224 as against 3142, 


which marks an error of about 2°6 wee rather larger than we might anticipate, and 
possibly due to the inclusion of a certain number of slightly damaged eggs, and 
the measurement of the eggs in the field and not in the laboratory. The relation 
between G, and B is a useful test of accuracy and should be determined with a 
slide rule before the egg is finally replaced in the nest, or lost sight of. 


The Longitudinal Girth G; is somewhat more difficult to measure, and a rough 
test of its accuracy not so easy to determine as in the case of G,. We have, how- 
ever, developed a formula for determining Gin terms of B and L, and on testing 
it we find that as a rule the differences are below 15mm. Such a formula may 
be useful as emphasising the need for remeasurements, when the observed and 
calculated girths have values much in excess of 15mm. We are not prepared 
to say, however, that the coefficients in this formula can be extended beyond 
the case of the Common Tern. 


While the Length-Breadth Index is valuable as giving a measure of the 
ellipticity of the egg, it is not of much influence on the apparent oval shape, 
unless we suppose some theoretical geometrical construction for the egg. Hf we 
suppose the blunt end of the egg to be approximately spherical, the hemisphere 
ending with the maximum breadth, then the egg might be considered as divided 
into two portions, the upper or hemispherical with radius $B and the lower with 
length from the base of the hemisphere (or ‘ equator’) to the lower pole = L—4$B. 
The ratio of these two segments of the length depends only on the index B/Z. 
Thus it is conceivable that this index has actually as much association with 
ovality as with ellipticity, although without some geometric theory of egg-shape, 
we are not able to make any dogmatic assertion as to the value of B/Z. It 
seems, however, a character of considerable interest as being free of absolute size 
and also some measure of shape. If J =B/L and O be the ratio of $B to L—4B, 
: B/L 
Le. O= = BIL eit 
have correlated O for eggs of the same clutch as well as J. Of course, since O 1s 
a function of J, there will be relatively little difference in the results. 


=I/(2—JI), we may consider O a measure of the ovality, and we 


Biometrika x 19 


146 On Homotyposis in Eggs of the Common Tern 


The mottling is a far more difficult matter for determination. The points 
which may be considered are: 


(1) Size and shape of individual splodges. 
(ii) Portion of the egg over which these splodges are distributed. 
(iii) Area of mottled surface as compared with whole area of the egg. 


The fieldworkers selected 9 typical mottlings (see Plate IX) and named these 
a, b,c, d,e, f, g,h, i; they then compared each recorded egg with these and selected 
the letter which marked the egg on the scale most resembling the egg to be 
recorded. There is little doubt that in this manner they divided the whole series 
of eggs into differentiated classes. But it may be doubted whether the judgment 
made depended on one only of the above three characteristics. Hence when we 
came to arrange the eggs a, b, c, d,...h, 7 on a scale of mottling, we found that the 
order would not be the same when we classified in turn by each of the three 
characteristics. We endeavoured to place the eggs in order by extent of mottling, 
ie. by (iii), but we think that the relatively low value of the homotyposis which 
has resulted is possibly due to size and shape of the mottlings, (i), having had 
as much influence on the classification as the extent of area mottling. Even 
position on the egg, (11), can influence judgment considerably. We believe that 
in future work on eggs, it would be desirable to classify the mottling of each 
by using the three characteristics independently. Even then an ocular appre- 
ciation, as this must be, may fail to give a very close measure of the nature of the 
mottling and thus weaken any homotypic correlation. 


The Ground Colour of these eggs varies through all shades of brown to 
brownish greens and blue-greens. The fieldworkers attempted to give the value or 
depth of ground-colour pigmentation without regard to the brown or green shade 
of colouring. The scale of values is given at the foot of Plate VIII. 


A point seemed worth consideration: assuming the pigments to be deposited 
on the egg in its passage through the oviduct, it was conceivable that greater 
pressure might indicate greater intensity of pigmentation. We accordingly 
selected the broader egg in each clutch and investigated for every pair of eggs 
from the same clutch whether the broader or narrower egg had the larger mass 
of mottling and greater density of ground colour. We reached the following 
results : 


The broader egg in every possible clutch-pair has 


Greater mottling in 26 cases | More dense ground colour in 25 cases 
a + « = . 

The same ‘ BY op The same > $ 39 
Less . 40s, | Less dense 5 5 BY a 


Perhaps not very much stress is to be laid on these results, but they suggest 
that the total amount of pigment deposited is less the broader the egg, i.e. for the 
same bird a relatively smaller egg will be more pigmented. A solution of this 


Biometrika, Vol. X, Part I. Plate VIII. 


Colour Value Scale 


Cambridge University Press 


SAMPLE EGGS,COMMON TERN, NATURAL SIZE 


Gs) 


enh 


i 


fe 


Biometrika, Vol. X, Part | Plate IX 


g h i 


TYPES OF MOTTLING OF EGGS OF COMMON TERN. 


W. Rowan, K. M. Parker anv J. BELL 147 


rather unexpected result may, perhaps, be found in the suggestion that the total 
amount of pigment is the same in both eggs, but the mottling and ground colour 
will appear denser on the smaller surface of the smaller egg. The point deserves 
consideration on the basis of larger numbers and possibly better defined measures 
of pigmentation. 


(3) Means and Variability. 


Table I gives the means, standard deviations and coefficients of variation of 
the several characters studied. It will be seen that the tern’s egg has for 
quantitative characters relatively small variation. The values of the coefficients 


TABLE I. 


Means and Variabilities (Absolute Measurements in Centimetres). 


Character | Mean SU ease 
| Deviation Variation 
| 
Length ZL ae .. | 4:14+:007 180 + -005 4°34 + "12 
Breadth 2B ‘ cog 2°98 + -004 099 + :010 3°33 4°09 
| Girth G... ae ae 11°39 + ‘015 376 + 010 3°30 + °09 
_ Girth G, mbt .. | 9°59+-014 347 + 010 3°62 4°10 
| Index B/L ~ Lad 72°04 4 136 3°449 + 096 [4°79+ °13] 
| Index of Ovality, O* .... | 56°35+°171 4:334+°121 [7°69 + °22] | 


of variation are less than many of those which we find for the human skull 
(3 to 8), but greater than those we know for the wing of the wasp. It is very 
doubtful whether the coefficients of variation of the indices should be included 
in such considerations, for the object of the use of these coefficients is to get 
rid of absolute lengths, and this is already done in the case of indicest. It is 
noteworthy that the length of the egg is only slightly more variable than the 
breadth and the breadth-girth is actually more variable than the length-girth. 


(4) Correlations. 


If we turn to the correlation of characters in the same egg, we note that 
while the ordinary product-moment correlation * has been calculated for all 
measurable pairs of characters, this is not possible for the ground colour or the 
mottling. Where mottling has been used with a quantitative character there 
has been calculated and both corrections used. Where mottling has been con- 
sidered in conjunction with ground colour, there we have adopted: mean square 


contingency correcting for both number of cells and for class-index correlations. 
* O=(B/L)/{2-(B/L)}. 


+ For example, if we take 1/0 for our index of ovality its mean =176-32, the standard deviation 


=11-24 and the coefficient of variation =6°38. Is O or 1/O the more variable? It does not seem that 
the coefficient of variation can help us in such a problem. 


19—2 


148 On Homotyposis in Eggs of the Common Tern 


Certain facts are at once obvious from this Table, others are obscured. In 
the first place length and breadth of the egg of the Common Tern have a rela- 
tively small relationship, while the relationship between the two girths is between 


TABLE II. 


Correlations of Characters in the same Egg. 


Characters | Symbols | Correlation Remarks 
— — = = = ——=— 
Length and Breadth L, B \+:2220+ 0374 = 
Longitudinal and Equatorial Girths | Gy, Gy +°5297 + 0284 = 
Length and Longitudinal Girth ...) Z , G, +°8804+-0088 = 
| Breadth and Longitudinal Girth ...| B, Gy) |+°5216+ :0286 = 
| Index and Longitudinz al Girth B/L, Gy | —'3832 + 0336 — 
Index and Length *: B/L, L |—"7284+°0185 = 
Index and Breadth es BIL, Bo | +5033 + 0294 = 
Mottling and Ground Colour M,C + ‘2260 (corrected C2) | More mottling, deeper ground colour 
Mottling and Index... | M, B/L |}—-1550 (corrected y) | Less mottling, higher index 
Mottling and Breadth | U, B — 1803 (corrected 7) | Less mottling, oreater breadth 
| Ground Colour and Index ... C, BIL ‘0000 (corrected n) | No relationship 
Ground Colour and Breadth | C, B  |—*1506 (corrected n) | Fainter ground colour, greater breadth 


two and three times as great. This probably flows from the consideration that 
the correlation of G, and G, arises from B being a factor in both and only 
secondarily from the correlation between Z and B. The correlation of the 
longitudinal girth with egg length is 60% higher than that of longitudinal 
girth with egg breadth; both these correlations are more substantial than that 
of the longitudinal girth, G,, with the egg index, B/Z. The egg index correlated 


with length is large and negative, and with breadth considerable and _ positive, 


precisely the results we should anticipate would appear if the correlation were 
largely spurious *. 


In order to ascertain how far it was possible to predict the longitudinal girth 
from length and breadth, double (for Z and B) and triple (for Z, B and B/L) 


regression formulae were ened out. The following equations resulted : 


(i) G.— G,=1-2701 (B-— B)+1:6415(L—L), 


or, G, = 12701 B +1°6415 L +8224, 
and = (ii) Gy — G; = — 17-2930 (B — B) + 146374 (L — L) +7636 (I —T), 
or, G, = — 17-2930 B + 146374 L +-7636 B/L — 527239. 


The first seventeen eggs were taken as a random set to test these results upon 
with the following values: 


* As a matter of fact the correlation of index and length for a constant breadth is —-997 and 
of index and breadth for a constant length is +-996 instead of unity. These values indicate how closely 
the linearity of regression holds in these quantitative measurements. 


W. Rowan, K. M. Parker and J. BELL 149 


TABLE III. 


Observed and Calculated Longitudinal Girths. 


Calculated Girth Ditference | 
Egg | Observed | 
Number | Girth | pmer l : 
Gy (ii) | (i) As | Ay 
ie - = | | =s 
i 11-40 11°14 | 11-20 + 26 +°20 | 
2 11°65 11°83 11°74 — 18 — ‘09 
3 12°10 12°24 12°07 | -— ‘14 } 4°03 | 
y 10-80 11°46 | 10°84 — “66 --04 
5 11°70 Deo ee aS + °47 +°39 | 
6 11-20 We elt — ‘07 —:14 | 
? 12°15 13°19 12:31 — 1°04 —16 | 
8 (i) 11-20 11719 1-27 + 01 — 07 | 
8 (ii) 11°30 11-09 WED =e. 2 +03 
9 (i) 11-50 1144 | 11°61 | + 06 Sil 
9 (it) 11-40 11°36 11°45 + -04 — 05 
10 11°50 11°52 11°61 — 02 —1l 
11 11°80 WES 5 iy ele 72 + 25 08 
12 11-90 11°62 11-74 + 28 +716 | 
13 -(@) 11°10 11°01 10°94 209: |. E16 
13 (ii) 10°80 10°75 10°78 + 05 | +02 
18 (iii) 11°70 1455 |) 11:55 + 25 e155 
e o3 a Root mean 354 146 | 
| square A | 


To judge by this small sample we obtain only increased inaccuracy by taking 
the more complicated formula. We shall only make an error of about 14 mm. if 
we calculate the longitudinal girth from 


G, = 12701 B +1:6415 E +8224, 


and for the egg of the Common Tern at least this is a convenient formula for 
verifying measurements in the field. 


The remaining correlations indicate sensible correlations, but these correlations 
might well be substantially higher had a better scale of mottling been adopted 
ab initio. In the first place we see that the mottling and the ground colour 
are sensibly correlated, and the deeper the ground colour the more intense is 
the mottling*. 


We have already seen (p. 146) that for eggs of the same clutch the broader 
has less intensity of ground colour and more meagre mottling. This is true 
for the eggs of the Common Tern in general, although it is probable that a better 
classification of mottling would bring out more marked correlations. The 


* This might probably be asserted interracially as well as intraracially, compare for example the 
swallow with the skylark, the lapwing with the ringed plover, etc. 


150 On Homotyposis in Eggs of the Common Tern 


following are the orders (a) of mottling chosen, (b) of breadth classes, (c) of 
index classes : 


| | 
| (a) () (c) 
| ser sy: . Order of Breadth Order of Index 
——— —_ {ics = —___— _ — 
Class B Class BIL Class 
gte+d a 3°00 a 72°64 
a c 2°99 f+i 72°54 
b gtetd 2°97 C 72°30 
Cc ttt 2°96 gtetd | 72:27 
h h 2°96 h 71°95 
ft+é b 2°95 b 70°54 
= Mean 2°98 Mean 72°30 


The relationship is small, but exists. It seems reasonable to suppose that 
the order of mottling classes as given by B or B/L, where there is only one 
displacement, may be a better one than that we have selected. But if in the 
mottling order b and c were interchanged, it would agree with the B classification, 
in so far that the three classes of least and of most mottling in the two classi- 
fications would be the same. 


We now turn to the ground colour. We see that the ground colour is 
fainter, when the egg has greater breadth, but that there is no relation of the 
index to the intensity of ground colour. The results of p. 147 are thus confirmed 
by the general correlation of ground colour and breadth. Although there is no 
high-correlation, we may assert that it is probable that the intensity of pigment 
dees not depend on the pressure during transit of the oviduct, but rather on 
a constant amount of pigment being distributed over a larger surface. 


(5) Homotyposis in Eggs of the same Clutch. 


The homotyposis, or degree of resemblance in character between eggs of the 
same clutch may be studied on the present material. The chief direct and cross 
homotypic correlations are given in Table IV. 


Pearson has shewn* that the degree of resemblance of undifferentiated ‘like 
organs’ might be expected to be equal to that of pairs of brethren, i.e. about *50, 
and proved that this is so for many homotypes in the vegetable kingdom, a result 
which has been since confirmed by much as yet unpublished material from the 
animal kingdom, including a number of series of birds’ eggs. Thus the mean 
value of the homotyposis for eggs of the Common Tern could hardly be improved 
upon. Only the colour characters show irregularity, especially the mottling, a 


* «On Homotyposis in the Vegetable Kingdom,” Phil. Trans. Vol. 197, A, pp. 285—379, 1900. 


ad hee 


W. Rowan, K. M. Parker ann J. BELL fod 


feature we have already indicated as difficult to measure. It will be seen that 
the correlation of the ground colour of an egg with the mottling of a second 
(3989) has come out greater than the organic correlation between mottling and 
ground colour in the same egg (2260). 


TABLE IV. 


Homotypic Correlations. 


| Symbols | Characters Correlation 
L, L£ | Lengths of Eggs in same clutch ... oe oes su ... | 4643 + 0346 
B, B | Breadths of Eggs in same clutch ... fee bet i ... | 5176+ 0326 
G, G, | Longitudinal Girths of Eggs in same clutch... et ... | 0076 + °0327 
G@,, @, | Equatorial Girths of Eggs in same clutch ae oss ... | 4621 + 0350 | 
: | Mean value ae te 4879 
Direct 
M, M | Mottling of Eggs in same clutch ... mo te “ee ae| °3500 
C, C | Ground colour of Eggs in same clutch ... es one oe “5709 
| Mean of six characters ... isi eis "4788 
| aan | 
L,B | Length of one Egg with Breadth of a second ... ... | 0922+ 0441 | 
C, M Ground colour of one Ege with Mottling of a second ... | 3989 + :0379 
Cross | £, G, | Length of one Egg with “Longitudinal Girth of a second... 4229 + '0362 
B, G | Breadth of one Ege with Longitudinal Girth of a second... | *2530+-0416 | 
1, Gy | Longitudinal Girth of one Egg with Equatorial Girth of a second | +2603 4 -0413 
| B/L, ie Indices of two Eggs of same clutch ie .. | 65374 0308 
Index | 0, Indices of ovality ‘of two Eggs of same clutch ... ee ... | 5527 + °0309 
| 1/0, Ho Inverse of indices of ov ality 5361+ :0317 | 
| | 
| | 
Mean of three Index Correlations —... 5475 
Mean of nine Homotypic Correlations 5017 | 


We feel that the classification by mottling is at present too uncertain, and 
that until the result cited has been confirmed with larger numbers and more 
definite categories, it would be idle to consider whether, while a given bird has 
usually highly or lowly pigmented eggs both as to ground colour and mottling, 
yet when in the individual egg there is an excess of mottling pigment, there 
may be some tendency to a relatively less increase of ground colour. Thus the 
correlation in the individual egg might possibly be less than the correlation 
between eggs of the same clutch. Such considerations must be postponed until 
the fact itself is adequately demonstrated. 


152 On Homotyposis in Eggs of the Common Tern 


Another relation suggested by Pearson* is that the cross homotypic corre- 
lation of the characters # and y should on the average equal } (correlation of 
w and w + correlation of y and y) x (the organic correlation of # and y). It is 
clearly impossible from what has just been said to apply this to the cross 
homotyposis of ground colour and mottling. We can apply it to the five cases 
in which quantitative measurements have been made. Table V gives the 
requisite data, the last two columns giving respectively the calculated and 
observed cross correlations. 


TABLE V. 


Cross Homotypic Correlations. 


Characters Direct Correlations ' Cross Correlation 
oe | Organic Correlation | Te 
| eee (1) and (2) | | 
| (1) (2) (1) and (1) | (2) and (2) | | Calculated | Observed 
| 

DL B | +4643 5176 22.20) +1090 0922 

L G, | “4643 OTK | “8804 | °4278 "42299 

ie G, | +5076 “4621 “5297 | +2568 2603 | 

B Gree ee ou7G Oi 5216 | "2674 "2530 | 
| eee 
| Gy 3) BEE “5076 SDDoM — 3832 | — +2083 — 2007 


When we compare the calculated and observed cross correlations, we see 
a striking agreement, or the theory that cross homotyposis is the product of 
direct homotyposis and the organic correlation of the characters under investi- 
gation holds very closely for the egg of the Common Tern. 

The general results obtained are in good accord with those reached by previous 
observers, and the authors hope to investigate one or two doubtful points on 
fuller material this year. 


* Phil. Trans. Vol. 197, A, p. 290. 


W. Rowan, K. M. Parker anv J. BELL 153 


APPENDIX OF CORRELATION TABLE 
TABLE A. Length and Breadth of Egg. 


Breadth. 
aS es 
[ | | | Totals 
Sikhs = {8 
39 89 Sa) Sn) 
| 1 
0) 
1 
| l 
Hee ee oat ae SS el Sy 5 
ee ee ah | ome le 5 
yas fake S33 sto Ie (eee rs 15 
SO eee ah hse eal | | ee | Pe 10 
on hes ee G6 Bal) A 25 
= ho Dule 6) hese) bal 5 26 
S05 09 ef —— |) — | 1 | 1 Dae ot lh gorleSa is) 2 27 
3 | 410—-p14 ee Sai) toe eon ebul|l 2 45 
415—h19 | Sea! OF Bi 2 8 24 
4:20—4-24 Win Siles | 6) 2 31 
425—429 Dae AN Ae AGB il 2 23 
430— 34 ee eG 3 ak) a 20 
4°35 439 Towser lute ero alo. loa 14 
44O—I 4h Oalenaileccceit al eae if 
44S 49 =| bl) tj |—)| 2] 5 
L50—b 54 Tj} 1{]—/ 2] — ] 6 | 
455—h59 | : | ; 0 
460—h65 1 
465—469 0 
LIO—-4VS 2 
Totals | ME Oe ileae os oa | 14 | 24 | 61 | 61 | 57 | 35 | 20| 10 | 2 | 294 
H | 
TABLE B. Garth L, and Girth B. Girth B. 
; i la2leaje| 
za . 9 
S| 5 
| ileal 
| Sales 
10°00—10:09 
10°10—10°19 
10:20—10°29 
10:30—10°39. | 
10-40—10°49 | 
10.50 10:59 4 —— | — |_-1 4 1 4h - | 
10°60—10°69 1 : = = | — 
< 10'70—10°'79 | — |.— | — ! 1 1 } 2 . 
10°80—10°89 | —- yl ya a 
e090 1090 == |= 1 yet 42) els) 1 | 
Se Om 1c 9 ean| ie et he Rs al | 97, We Se | ee 
© | i10o-1119] —| —| —|—| 2] 5] 9| 5] 2/—J—J1]— 
Mcgee Ve See ee) SINS lg 10}, 8} 5)/—]} 1}—)}— 
HATED Sa 30) | | ee ene) a em Fe el a |e ea 
eter .0— 7 1-7.) |) apace Webern @7 ==. ey 
| 11:50—11°59 Tees eee ae Ila acelin 
eel 60— 11-694) — ||. | —— iL a=} | AR i) 5 5 5 = x 
11°70—11°79 Sele Toshio eGullet — 
| 11:-80—11°89 > Teele ea Go Goe co) 
11°90--11:99 TOW POF ie Assen, oe | 
el? 00== 72-99 — = | — |) — = | | BE i) 
12:10— 12:19 | 1| 2 
12:20—12:29 et 
Totals 


Biometrika x WY 


Eggs of the Common Tern 


yposis in 


On Homot 


154 


“we 


a] ana 


HMO MHA A SS 


maf OD 19 19 190 | 


AAD O MOM | 


age ee pee es 

Sey aetna eh aera ee 

S|) Se ee ua 

y |7 

Cia lecue cee 

CUeOl | Galee aa 

SSM sleOs oleae ile | = 

See OE Neh enl Oe sl peel its ela 

Te ee ealeG tl aer eG eal) ilar = oe 
eich e\eGen ois) ee sed = 
= WEL Rowalntoe eee Nigel = 
[eh al at eee ier — = 

Ti lee ee 


STPIO, 


67-61—04-6T 
68-61 —06 6T 
66-61-—08.61 
61-6I—O1L-6T 
60-@1— 00-81 
66-TI—06-1T 
68-TI—08-1T 
6h-TE—OL-TT 
69-TI—09-1T 


65-1I—0¢-IT | 
67-II—OF-1T | 


68:-TI—08-1T 


66-TI—06-TT | 


61-TI—OT-TT 
60-TI—00.TT 
66-01 —06-0T 
68:01 —08:0T 
64-01 —O4L-0T 
69-01 —09-0T 


6¢-01—0¢-0L | 


64-01—0%.0L 
68-01 —08-0T 
62-01 —06-0L 
61-0I1—OL-0L 


Me bels®) 


60-0I—00-0T | 


"Tyne pup bby fo yybuayT 


‘OD FTavVit 


SiS Hie] eis SS ~~ | & | ~ S ~ Og Gg | te 

~ lL Ql} Al alT ale] se] ow] eo] ws S/R} RY Ss o D | a 

ae alle st SN > N S an S in S NY > N S | a 

IAS a eRe aL acl OG Cd en al a Wa | | | 

S|) see || Seale Se || elf Se alfeosarn te Set] Seal Sea ge Nosed | seta [ees <s ce oe 
‘yysuarTy 


Girth L. 


W. Rowan, K. M. Parker and J. BELL 155 


TABLE D. 


Breadth of Egg and Girth L. 


Breadth. 
»ni{[slaflsilaltssijafjsiloaltlxsiolwsloalrs 
Ma OU OES He WSO | FO. EO Oa 8S |S lo | | ee 
i) NX X » RW | & NW | & » % | % 9 | Ss) 
| | | | | | | | anlee| | eal | | Totals | 
ip S wy oS iW Ss wD S wD Ss Xe) S wD iS 
VOM Ore || Reale eco |G) | Ose Os |S S| a | ose |e 
RiRIl R{/RIRQIRI RI RI Rl nls {_n]_o] Ss 
10:00—10°09 1 
10°10-—-10:19 O08 
10°20—10°29 2 | 
10-30 —10°39 0 
10°40—10°49 2 3 
10°50—10:°59 | — | — | —| 1 1 - 2 
10°60—10°69 | — | — if eee ee eee | ee ee oe ee 1 
10°70 —10°79 | ee fag Fe = 4 
10°80—10°89 | | 5}. 1 a] |e 3 13 
10:90—=10:99 | —|—|)—| 1)}—|].1] 4| 4] 38] 5 18 
00 —thOO— |e | |=) 1 Ae ee Sale A eee ea Se | ees 16 
neon —— aie | 2 IO A | 4 | oe |) 1 24 
11:20—11'29 | — se Pe Te a eLO! Ch eed 2d 32 
AC eou wee fe eT lO Bee | Be | 4 = 3 
11-40—11-49 | eee ne oaelise| Cree. | Tk |p=—. lea 
11-50—11°59 | Dele eG oe Ob etal Wee ety cae 
11:60—11°69 | eye aliemee ie etal Pret can els aie oh |) 
11-°70—11°'79 | — - eae wo Bal) Vel 2) | 
11-80—11°89 | Aa 28) OC an a5 | al 
11:90—-11:99 Ses Paleese coer Oras | Seb ape 
12:00—12:09 | — | — | — | =i ee been. Sl at Weg i) = 
12:10—12:19 | — | — | | — | 2/—]| 1}/—] 2) 1J— 
12:20—12°29 | | | 1 
12°30 —12°39 =| | | = ae ee 
12:40—12°49 | | | | | — | - i 
pe tn 
i i} 
Heer ete oe a ae 24 | 61 | 61 57 | 35 | 20 | 10} 2 


TABLE E. Girth LE and Index 100 Breadth/Length. 
Girth ZL. 


156 On Homotyposis in Eggs of the Common Tern 


2 
| =) DOARNAMONHMAA tH 
$ AMI O10 N z 
i= 
67.6I—04-6I a 
i 
| 66-61 —08.6T S 
| 66-61 06-61 = 
| i 
61-61—O01-61T No) 
| 60-61 —00-é1 Ne) eS eS 
66-TI—06-TT oS 
| 
yeaa ellie ere 9 
68-1T—08-IT is 3S 
: a x Kt 
6L-TI—-OL-TT sy] oN 
nl ————| 
ee = = 
69-TI—09-T1 | = = 
> ° 
0, TT— Ne. oo = a 
69-LI—0¢-T] <e ma) 
ea ays x 
64-LI—0V-1I g = eal 
| Ce,.7T—ne aA OONHN aa = o> 
65-1 09:0E a S 
a | ee ee = ST 
6%-TI—0@-TI | Aamo a ac a 
4 oD tS ap b 
| S § |¢1-4—o1-4 s) 
6L-TIL—OL-TT a i Ss : = 
Pomerat ae 3 4 —C0.4 5 
: A 60-%—GO. 
| 60-1 00-1 7 cee = 
oy — Tl) SS tn tf —pn.t Re) 
66-01 —06-01 eet io) = LOT ROOT HY 
— 4 ~ = 
a = S 
1 O-—CR.E Ye) 
68-0I—08-01 29 = ee ms a 
1 N te.0—nG.e oO 
62-01 —0L-0T _ 16-806- oy 
, .E—G9.E Ne) 
69-01 —09.01 = = 68-8 98-8 i 
— & is 
.E—09-¢ 10 
| 69-01—09-01 aa a] 18.E—08-E 
: 
; arom 64-E-—-GAE oo) 
| 6-01-—04-01 Po = eee 
ae /.e ay pia = 
6801-08-01 ° ee Bice 
ee 69-2 —O9-¢ = 
66-01 —04-01 a 2 ‘ ? 
[Sue e =n perom a Py Sava (a 
, Pale 19-6€—09-§ °o 
OL-01—OL-0T =) : 
60-01 —00-0T A 
PA BABAARSRAW 
MMBIHRD HNWVIwD 
Mm he 4 
| ~ 
Seeeeseeseese {es 
3S X Sos Xe) S 
Q > & SSSSRRRKKLSVE 


* 
o 
ire) 
fo 
La) 


‘XOpuyTy 


W. Rowan, K. M. Parker anv J. BELL 157 


TABLE G. 


Breadth of Egg and Index 100 Breadth/Length. 
Breadth. 


fon | & + | d + | > = | oO s+ | st | Pe 

BS I Se Sh all Ps Te pe teeky tS ce eS) SS ee tt caainllE S | 

Rl Rl} _ eX |_ Raia Ie2le sl a&laelse lala] | 
| ee era Ne eh ee ay tb) oh ll a Botals} 

1D S LD 2 Ro > WwW > Ww > WD S w S 

eso ek | els pole TS |S Sis | 1S 

Ri RIL RIRI RIL RIRIRIRARIni HISD | SH /1H 


| 62:0—63'9 
64:0—65°9 
| 66-:0—67°9 
| 68:0—69'9 
70°0—71°9 
72-0—73'9 
ThO—75'9 
%6-0—77'9 
¥80—79'9 
80°0—81'9 
82:0—83°9 
84:0—85°9 


| | ~1m ts = conn 


Index. 
WoL PDE 
> OW DO NT bo Ow Or 


| aa) 
oe OTN 


Shiono kas | 


bo OOD) ATW bo 
on) 


i] 
Pe web 


bo bo OAT bo 


Totals 


TABLE H. 


Ground Colour and Mottling. 
Ground Colour. 


Totals 
15 16 | ; 73 
(+3-96) | (—5-28) | (+ °70) 
5 | 4 4 26 
(+107) | (+ :69) | (—1:45) 
So 14 3 49 
is (23°73) 
~ | 
18 ‘ 101 
= 
| 13 
| 29 
291 


158 On Homotyposis in Eggs of the Common Tern 


TABLE J. 
Index 100 Breadth/Length and Mottling. 


Index. 
S| SD [SD [Sa] Seonloaian | Lom onltoraicay 
SPU eet met comm ipo ian Umut WIL CAy mylene cco Ionian | be iassy [1 Wo) 
| © Ks) Ne) S ~ 2S ro NS rs ra) ore) ey | 
| | | anya | | | | | Patel Totals 
| San Ease ec eae a ge ell eral ieka eager |S | 
| Sy =“ xe} Soha | oe) RN => Ne} nH SS eR |= 
| Ne) Ne) Ne) 3° NN reel) ES BS ~ | w Se) ies) 
oc | 2 
a g+d+te 15 | 7 — 
D> a 6 l 
6 b Sule 
= c 92/4) 99 
h 5 
Fri 5 
Totals 


TABLE K, 
Breadth of Egg and Mottling. 
Breadth of Egg. 


| 


& : 3 a) 35 | 3 
esl | | | Totals 
2) i=) S 
S = RN 
on - 
| gtdte — | Qa Galva Feo UO ote GP oy ts : 
= a —|—;—]—j;—] 1) 1] 6] 4) 6] 3] 3] 2) — 26 
2 b ee eee Mee kerala sey ymoy lee |) ay aloe on 49 
aa ¢ 1. | 2) — | 8 3s Bl aa O28 19y) alize |e onli b: 1) Sie heaton 
h ee ee es ee ee hea 
fri | ie ae eA eal S| ef 29 
NN Oa a a a UO Po 
Totals 1 | 2{1|]2| 41/14/24] 59/611 56] 35| 20/10] 2 | on 
momAl li As Ht i é | | 


W. Rowan, K.-M. Parker 


TABLE L. 
Ground Colour and Index 


Ground Colour. 


AND J. BELL 


100 B/L. 


9 Seat canny | 


a eel edlaa fa i 


62:°0—62'9 


63:0—63'9 
64 O—C49 
65°0—65'9 
66°0—66'9 
67:0—67'9 
68-0—68'9 
69:0—69'9 
70:0—70°9 
71:0—71°9 
72:0—72:9 
73:0—738°S 
ThO—TH'9 
| 75:0—75°9 
760-769 
TiO Ti 
| %78:0—78-9 
79:0 —79°9 


Index, 


| Reo akacwwe pr | 
| Si paroo. aa en og eo<ar'| =i | ws | wre bp 


OD 


Hew bono e | les 


me NM DO or 


80°0—80'9 
S1'0—81°9 


| 82:0—82:9 


S3-0—83'9 
8h0—8 49 
85:0—-85'9 


Totals 


‘ig | 18 


TABLE M. 


Ground Colour and Breadth. 


Ground Colour. 


{ 
t 


Re fee 


| 
Oh 9'59 
260—2-64 
2-65—2°69 
270-27 4 | 
2-75—2°79 = 2 
S| 280-284 1 anes 4 
canis 2:85-9:39 f— | 3} 1) .3)| 81% 
Ss | 290-294 | 4| 8/13] 6) 12| 7 
oO 2:95—2:99 Sol e380) Bo 88 dill a5 
S00 S:0he Nee) | Le 10) Ob 10 
3:05 —3:09 Sf 6 Biaor| 8 
310—3:14 | — Nala he oe eaO 
PUB — PUG) | eh eT ih a 
ais 20-8121 =| 11 - 
| 


159 


TABLE N. 


Breadth of Egg in Pairs of same Clutch. 


On Homotyposis in Eggs of the Common Tern 


F0I—3 09 


3:10—8:14 
I — ITH) 


Totals 


| 
1 
pi 


a S oom = a 
» X R cS) & x ) x» | 3 85 S95 85 
Die Ue es ees ee Poet el Maley Walls ix] 
9» > a) AD YS Ww i) Ww S wD S wD 
ME Sigal SSF eet COT SCO SO OS A SNE) OPS a 
2 il RI Riel RIRliaAgl Rial ala |] a 
. | | ] 
| 255—259 | — | 1 
260—2°64 | 1 ; | 
265—2°7), = | cae Es 
275—2°79 ~ | 1 1 
| 280—2-84 = - | | 2 2) 4 2) L|— Wh || 
2-85—2°89 | 2 1 4 i) 1} —}--}| — 
290—2'94 1 4 4/10); 14 8 5 1 
2:95—2:°99 1 2 3) 16 | 12 Oy) || — 
| S:00—3 04 it 1 12 |) 12 8 8} 2 
- 8 4 6) |5—— 
8-i\ 135) 74 
2 ale 


= 
on) 
ase 
-I 
ou 
or 
or 
is) 


Totals 


TABLE O. 


Length of Egg in Pairs of same Clutch. 


L00—4 04 
4O05—4'09 


st D> Ty D> 
~ ~ XN Se 
at 
a lh | eal 
a[ mle] es 
SS eashe | Sse 


L55—43B9 


44O—Y 4S 


J 
= 
ol 


Sd ate Sd be 9) Dd 
XY fol lolals 
Sn) 35 35 An Sn) 
| | | | | 
wW SO WwW > WwW 
~~ | %© rs) Dd DS 
& | S Sn) Sn) 89 

Se ED | 

BT5—B TI | |-- 1 

3-80—3°8 4 | 


Totals 


20} 14] 1 


3-85—3°89 | — Ze ey Male ale at al — 
3:90-—3'94. 4 — | — 2) 2 3 a ea 
3953-99} 1 }—} 2/ 8] 6] 6] 2} 2}—] 2}—}—];—}—}]—]— 
4:00 —404 = 6 28 23 225) See es es 
4:05—4-09 | — =) Se a ae es le Sie ee 
410—414]—| 2 1 ee lee eta eS Ee ey MN cet Ea The hl 
| £15—419 | — | — 1;—|;—] 8] 4] 6] 2] 5)| 1 2};—;—|]—]— 
| 420—4241—|—! 1} 1] 2) 1) 1] 2] 5) 2) -2) 2) =|] 1 | —] 3 
| 4:25—429 : ;}—/ 4] 1] 3] 1} 2/—}]—jf 1]— = 
| 80-484 1— |—|—]| Ly—] 1] 1) 4) 2) a 4 be |e 
435—439 3 Deh — = | Se 4e |) 
Y4O--L hh Sl eee | 1 | 
| #45—4'49 =| pl 1 
450—h 54 | | 5 
1 oa 28 | 24 | 36 | 24 | 23 | 12 W 8 


Totals 


10:20—10:29 
10°30—10'39 
10°40—10'49 
10°50—10°59 
10°60—10°69 
10°70--10°79 
10°80—10'89 
10:90—10°99 
11:00—11:09 
11°10—11'19 


W. Rowan, K. M. Parker anv J. BELL 161 
TABLE P. 
Girth B in Pairs of same Clutch. 
| S(E(SiSisl/SyF/s{/Sl/sisys 
| ine) ine) ioe) Dd o> Dp D len) Ss S S S 
| s s 4 
| asec eter te dea al ie he i otels 
| S S S S S S S S S S S S 
R/S (S/S PRPSsl/Sl sel slaelSis 
~ ~ ~a Dd =) Dp ier) im) S S S S 
4 n 4 a 
8:20— 8°39 9 
8-40— 8°79 0 
&*80— 8:99 2 
9:00— 9:19 16 
9°20— 9°39 37 
9-40— 9°59 68 
9°60-— 9°79 57 
9°80— 9:99 31 
10°00—10°19 11 
10°20—10°39 3 
10°40—10°59 0 
10°60—10°79 3 
Totals 230 
TABLE Q. 
Girth L in Pairs of same Clutch. 
Sn oylliFas) Ss > a lala sf eS eS ss) es ee es 
S)/s/s is & = {| 2] 8 SiSi/E l/s lsl/ sls 
S =) S S S ™ mt ~ a il a ™ ™m | & N 
™ ™ iat = Oh] = =e ~ ™ = ™ ™ ™ | ™ 
ieee | sales | | Loiieede alae Pa | ok Pa) | totals! 
Ss S S S S S S S S S i S S =) S | 
S/S) s |S © =| 2/8 e/S}/eF;el/se]}s]s 
S S SS S ™ ™ ™ ban i iH ~~ a RN R 
SO 4 nN 1 mm | N sO | i 
1 


be | 


11°20—11°29 
11°30—11°39 
I1:-40—11-49 
11°50—11°59 
11:60—11°69 
11°70—11°79 
11'80—11°89 
11:90—11°99 
12:00—12:09 
12°10 —12'19 


Totals 


Oalla=. 
— | — 3 Il 2 
4 2) — 1 
3 2 + 3) 
1 | — 3 6 4 
2 1}|— 4 4 
— 2 2 2 6 
1 5 1) — 3 
1 
2); —}]— Fs |) al 
—/;|—|— il 1 
1 2 


rar [esrron Iker l. ‘a> no hoses: | 


| | | woanad | roe 


1 @) || al 1 


=at= | 2 
2 

Fle tte it aie, ee 

A esa ltota lt ested ao) a) ee ee 

ea OuINe a eal ik hel ae 

3| 3 

eon eee a Sol ae a 

Sp Gel eese |ulegte es eel 

3 ee In oi) wou 

yal ige |e | uae | a ees 

Se Woh Bou ee ee 

1 = 

(| een ee 

a1 | 20 | 18 | 11 | ose elle 


Biometrika x 


21 


162 On Homotyposis in Eggs of the Common Tern 


TABLE R. 
Mottling in Pairs of Eggs of same Clutch. 


Totals 


a 


OQ 


h 


fti Totals 


Pairs of same Clutch. 


TABLE &. 
Ground Colour of one kgg with Mottling of the other Egg for 


Ground Colour. 


Totals 
gtet+d 8 67 
(-5°16) 
al | 6 Wly/ 
| (+2°66) 
bh b 31 
= 
S Cc 10 82 
s (-1°71) 
h 1 14 
(1-00) | (— -94) 
f+t 4 3 13 
(+2°14) | (41:20) 
Totals 32 31 224 
iS 


In Tables R—T, the contingency of each cell is given in brackets, 


W. Rowan, K. M. Parker anp J BELL 163 


TABLE T. 


Ground Colour in Pairs of Eggs of same Clutch. 


a+b 


12 
42) | (+ °58) | (4+10°47)) (41:43) | (—2°97) 


f 3 4 4 12 20 8 51 
(— 5-96)] (—3-12) | (—3-12) |(+ 1°43), (48-28) | (+2°49) 

g—k 1 4 1 2 | 8 8 24 
(- 3-22) (42°49) | (45-41) 

Totals 222 


TABLE U. 
Length of one Egg with Breadth of the other Egg for Pairs of same Clutch. 


Length of one Egg of Pair. 


Breadth of Second Egg of Pair. 


a = = = ms 
2) S/S 2 | 
i Totals 
ese 
2:55—2'59 | | 1 | — c= = 1 
260—2°64 | — | — eee | 1 
265—2°7 = | = 0) 
25— 27 1 1 2 
2°80—2°84 = 1 1 a 1 2 2 1] — 1 12 
2°85—2:89 | — | — | — 1 3 oe 3 3 }|— 2 16 
2:90—2:94 1 4 1 7 3 6 i 2 is) 2 5 tn eee ee | 47 
2:95—2:99 | 1 | — 4 2 2 8 9 6 (ji) zt 83 5 Ab ee eel 55 
3:00—3 04 il 2 2 2 i 2H a3 5 4 5 4 6;—|]—]|1 52 
POSER) | | 4 (oe | 4 4 2 3 2)— 1 |= 1} — Zo) 
SHOE | | 1 2 3 3 1 1 Ae | |) SS ee 17 
3:15—3'19 == | 1} — — | — 2 
Totals 230 


164 


Length Girth of Second Egg. 


Length and 


TABLE V. 
Girth L. in Pairs of same Clutch. 


On Homotyposis in Eggs of the Common Tern 


10°20—10°39 
10°40—10°59 
10°60—10°79 


| 10°SO—10°99 


11°00—11°19 
11°20—11°39 
11°40—11°59 
11°60—11°79 
11°80—11°99 
1200—12°19 


Totals 


Length of First Egg 

& |os fe |aelalalyalals 

| | | | | | | | | | Totals 
Si S S => } S S S 

= S/Sia|/RXl/ S/S] s 

39 oi|(sei sisal arlasrls 


| to bo no bo co bo | 


pre 
EPDprhe OQD-! 


— i 
BORO Or 


TABLE W. 


Breadth and Girth L. in Pairs of same Clutch. 


Breadth of First Egg. 


2:55—2°59 


Length Girth of Second Egg. 


Totals 


10°20—10'39 


| 10°40—10°59 


10°60—10°79 
10°50—10°99 
11°00—11°19 
11°20—11°39 


| 11:40—11°59 


11°60—-11°79 
11°SO0O—11°99 
12°00—12°19 


2°65—2°69 


270-274 


280—2'84 


rele! | 


285—2°89 


47 


| Gees ioe | 


2:95—2:99 


Sora! | | | 200-304 


bo oR HAT 


on 
bo 


3°05—3'09 


bo 
or 


3-10—3'1h 


lromaale| | | 


Length Girth of Second Egg. 


W. Rowan, K. M. Parker anp J. BELL 


TABLE X. 


Girth L. and Girth B. in Pairs of same Clutch. 


Girth of First Egg. 


165 


10°20—10°39 


8:20— 8°89 


Breadth 
aD Sd 
NN lon} 
Sc | oo 

| | 
S > 
& | & 


9:20— 9°39 
9-40— 9°59 


9:00— 9:19 


10:00 —10°19 | 


10°40—10°59 


Index 100 Breadth/Length in Pairs of same Clutch. 


op Meros50 lst ms a ad Se |p | 
10°60—10°75 etl Se ee | | 
OOO OO A | 2) G18 eo a) Se | 
11-00—11°19 Ae ol ae pie eb |= melt 
11°20—11°39 }—| 83| 9/11 ]}12]} 8} 4)/—]—j 1 
ie om 0 oOl a a 9 3a) SAN tA Ni | 6) B)|| 2el\e=9) 
11°60—11°79 Sel eouee ied | sole ot tle = 
11°80—11°99 ees alpaca ee 50m al wt j=l) Sete 
12-00—12°19 | } 2] 1] | 
Totals oF Oo iO | 8 | 16 | 37 | 68 | 59 | 31 | eS lewia| 13 
TABLE Y. 


65 


69 70 


| TL NG 72 


failed 


76 


= — 2 1 
15 3D) | ol 1 
3 say |p aay |i) baa) 
15 | — 3 2°5 
15 | 3 tf 75 
15) 2eon 1 :on Le 
— 5 | 5 5 
3 6 
1 2 15] 6°5 
5 °B) || —— 3 
| 1 


bo LY BO Ge OH 
OV CH 


5 

O55 | 
— Lat 
== 4/4] 
= |? 
3 I 1: 
6 |6: 
25/1 
10 
il 
2 

5 | 2 


Se OW HO 


5 | 
‘i | 
3 I — {5 | — 
B59 ea ebe 
2 | 5/—|]—|— 
3 2°5| — | — 
i 62: — | — | 
ee 
25 
3) 


Totals | 


HPHOmlprEwoH 
HOR HR HH 


at 


| 
| 


166 On Homotyposis in Eggs of the Common Tern 


MONK FOr DMDOAADIEOD 
sl AAMAS 


wd A 
ie ics re) 


58 | 59 | 60 | 61 


in Pairs of same Clutch. 


TABLE Z. 


100 B/L 
2 — B/L 


6321 85h C66 56 | 57 


Index 


TABLE AA. 


Clutch. 


f same 


a 


2 L/B—1 in Pairs 


W. Rowan, K. M. Parker anv J. BELL 


60-6—00-6 


66-I1—£6-T 


96-I—¥6-1 


86-I—16-1 


[egal Sse Ge 


06-I—88-T 


lie Wlealer? eee ete 


48-I—G8-T 


| Ja aa | faa] | | 


| [aaa ece | 


ee a 


mame HTN WOODY SS 
| 


nN |p ora 69 a1 | 


[Egat tina SenlhON | 


69-I—49-T 


[| <2 JAG Sts A eSeSiaECY 


99-I—"9.1 


Anan Pao walion 


&9-L—19.T 


sow | a! | 


09-I—8G-T 


AG EGET 


4G.I—@G-T 


167 


168 


Index of Second Egg. 


| 


TABLE BB. 


On Homotyposis in Eggs of the Common Tern 


Girth L and Index 100 B/L in Pairs of same Clutch. 


Le 


ng 


th—Guirth of First Egg. 


3 | 26 


(39 | 48 


51 | 


> |X |< Di oma) fies ol | al |iey 
S571 BO Sa S> oH SORE |e S= IEC Salas 
> > > SSF | sStlosh PS St | Ss 
ee ee ere IL | 
S = SIS S Ss S > S S 
X Sy Se) se Sis pat | SO at) ACO SS 
=) =) S =) ™ ™ = ™ ™ RX 
~ ial ~ ~ m el al ~ ~ 
SS a a ee ee 
| 62:0—62:9 | 14s ee 
63:0—638'9 — 
64:0—64'9 | 2 1} — |] — 
69:0=—65°9 - 1 
66°0—66°9} 1 1 =: 1)>—|;— 2/— 2 
| 67:0—67'°9 3 1 2 he 
68:0—68'9 Healy oll 3 9 el) = 
69:0—69°9 Lie Q aoe 
_ - . e | ¢ 
70°0--70'9] —|—}|1 | 1 | Ge) 2B a 22 ae 
CL 07129: 3) LON WS 8 2 Sued 
T20—72:9 6 3] 6 7 6 | 7 | — 
73:0—73'9 Sle Bol al) e6a) ¢3a|ee3a es 
TLO—7TL9t— |—]} 1 5 4 ul 6 4 Py i 
75:0—75:9] — |— | l 2 4 2 3 Ba lp 
76-0—76:9}| — | l = 4 2 2 1 1S 
77:0—77'9 > 1 Vets 3 
78°0—78°9 1 
79:0—79°9 —}|—|—| 1 
| | | 
Totals 


MISCELLANEA. 


I. The Statistical Study of Dietaries, a reply to 
Professor Karl Pearson. 


By Proressor D. NOEL PATON, F.R.S. 


PROFESSOR PrARSON’S criticism of Miss Lindsay’s Study of the Diets of the Labouring 
Classes in the City of Glasgow (Biometrika, Vol. 1x. Oct. 1913) is a good example of the 
danger of one who does not understand the problems involved and who is ignorant of the 
work already done upon a subject attempting to discredit the results of an investigation by 
the application of mathematics according to his own fancy and in, what seems to me, a 
totally illegitimate manner. 


Not appreciating the questions which were under investigation, he starts his criticism by 
demanding that our studies should afford a solution of problems other than those we had 
before us, and, because he does not find the solution of these problems, he proceeds to abuse 
the work. 


Apparently in his opinion the object of the studies should have been to determine what 
effect the diets which the families were taking at the time of the study had upon the 
physique of the various individuals. He states that, if adequate anthropometric observations 
had been secured in such a study, it would have been at once possible to co-relate these 
with the diets. It is unnecessary to point out, as was pointed out in the Report, that the 
physique is determined by the whole previous condition of life and by the influence of 
heredity, and that it is absurd to attempt to relate it solely to the diet (Report, pp. 3 and 4). 


The objects of the studies are quite clearly stated on p. 4 of the Report: “Do the 
working classes of this city get such a diet as will enable them to develop into strong, 
healthy, energetic men, and, as men, will enable them to do a strenuous day’s work ; or are 
the conditions of the labouring classes such that a suitable diet is not obtainable? Further, 
if a suitable diet is obtainable, and is obtained, is it procured, or can it be procured, at a 
cost low enough to leave a margin sufficient to cover the other necessary expenses of the 
family life, with something over for those pleasures and amenities without which the very 
continuance of life is of doubtful value?” 


It was accepted as proved by previous work that for the labouring classes: “If a family 
diet...... gives a yield of energy of less than 3500 Calories per man per day it is insufficient 
for active work, and if less than 3000 it is quite inadequate for the proper maintenance of 
growth and normal activity.” 


The first question investigated was: “Did the families examined receive this supply of 
energy?” As regards the poorest classes this was answered in the negative. The validity of 
this conclusion has not been challenged by Professor Pearson, 


Biometrika x 22 


170 Miscellanea 


The second question considered was whether the diets contained a sufficient supply of 
protein. Previous work indicates that this is probably something above 110 grms. per man 
per diem. It was shown that in families with regular incomes of over 20s. a week the 
average protein intake was above 110 grms., and that in families with regular incomes and 
in those with irregular incomes of under 20s. a week the average protein intake was under 
110 grms. This conclusion has not been refuted. 


Accepting our premises, the final conclusion was (p. 27) “that while the labouring classes 
with a regular income of over 20s. a week generally manage to secure a diet approaching the 
proper standard for active life, those with a smaller income and those with an irregular 
income entirely fail to get a supply of food sufficient for the proper development and growth 
of the body and for the maintenance of the capacity for active work.” 


The main points proposed for the study were thus elucidated. : 


The part of the Report to which Professor Pearson specially directs his criticism is not 
the main problem, but that dealt with on pp. 30 and 31—The Physique of Children in 
Relationship to Diet, a subject taken up at the suggestion of Dr Chalmers. Professor Pearson, 
having declared the data totally insufficient, proceeds to apply his statistical methods not to 
refute Miss Lindsay’s conclusion, but to demolish other conclusions upon the relationship of 
physique to income which were never deduced by us. 


The very guarded conclusion in the Report was: “These show very markedly the relation- 
ship between the physique and the food. When the weight is much below the average for that 
age almost without exception the diet is inadequate.” 


Weights alone were considered. Thirty-six children, boys and girls, were dealt with. As 
the relationship of weight to income was not under consideration, they were classified not 
according to the income but according to the energy value of the family diet. Hence 
Professor Pearson’s remarks upon this point are quite beside the mark. 


I give below, in a re-arranged form, the Table from Appendix IV. The individuals are 
placed in two groups according to the energy value of their diets, with, opposite each child, 
the average weight for the age, taken from the Report of the Anthropometric Committee 
published in the Transactions of the British Association for the Advancement of Science, 
1883, and with the difference between the weight of the child and the average weight. The 
differences between groups 1 and 2 are sufficiently marked and warrant the conclusion as 
stated above. 


That is, of the children in families the diets of which yielded more than 3000 Calories per 
man per day: 
10 were above the standard or not more than 5 lbs. below it, 
8 were more than 5 lbs. below it, 


while of the children in families in which the diet yielded less than 3000 Calories 


3 were above the standard or not more than 5 lbs. below it, 
15 were more than 5 lbs. below it. 


It must be remembered that the ‘standard’ is for the children of all classes and not for 
those of the poorer classes. 


The fact that the average age of the children in the second group was about 1? years 
greater than that of the children in the first group does not account for the marked 
difference. 


The last question which Miss Lindsay had to consider was, how the necessary supply of 
energy and of protein might be supplied without increased expenditure, and she was right in 
stating that these can be more cheaply purchased in vegetable than in animal foods. She 


Miscellanea 171 


TABLE A. 


Family Diets above 3000 Calories per Man per Day. 


_. | Age in Weight | enenray| 
Number | Calories ve ; Sex nee OA Weight | Difference 
ears in lbs. . 
in lbs. 

2 4003 a f°) 39 46°7 - 77 
2 4003 10 3 63 67°5 — 45 
2 4003 8 3 50 54:9 — 49 
2 4003 5 3 35 39°9 -— 49 
36 4091 3°25 3 35 5450 0) 
4 3882 8 2) 45 52°2 | — 72 
32 3822 6°25 2 39 42°4 — 34 
4 3882 6 ie) 39 42°4 - 34 
4 3882 10 3} 56 67°5 -11°5 
39 3422 10°5 2 55 65 —10°0 
50 3471 6°25 io) 37 42-4 | — 54 
50 3215 6 9) 47 42-4 + 46 
BS 3116 6 2 43 42-4 + 06 
18 3248 55 @ 43 41-0 ap 2450) 
54 | 3282 5 Q 33 39°6 — 66 
58 | 3080 6 3 38 44-4 | — 6-4 
30* | 3136 55 4 21 4] | ~—20-0 
49 | — 3841 55 3 42 41 + 1 


* Family with rickets. 


TABLE B. 


Family Diets below 3000 Calories per Man per Day. 


. A | 3 Standard 
Number Calories ' Age . Sex Weight Weight | Ditference 
in Diet in years in Ibs. nba: 

d4 2690 13 ie) 76 87 —11°0 
14 2690 12 Q 60 76°4 —16°4 
14 2690 10 ie) 45°5 62:0 —17°5 
US 2936 10 o 56 62:0 — 6:0 
7 2931 9°75 io) 44 62°0 —18°0 
D0 2686 5°75 2 42 42°4 — o4 
14 2690 9 3 45 60°4 — 15-4 
4l 2723 6°75 3 53 49°77 + 3:3 
14 2690 6 3 36 44-4 — 84 
57 2974 5 3 37 39°9 — 29 

3 2891 5 rey 37 39°9 — 29 

2 2772 5eD 2 34 41°0 — 7:0 
24 2412 eT: 2 39 68°1 —29°1 
21 2329 9 g 37°5 55°5 —18°0 
24 2412 6 g 28 42°4 —14°4 
21 2329 1 3 60 72°0 —12°0 
10 2435 8 3 43 54°9 —11°9 
59 1978 5 4 26 39°9 —13°9 


172 Miscellanea 


undoubtedly starts with the well-known conclusion that a Calorie in the food absorbed in a 
mixed diet from whatever source, protein, fat or carbohydrate is of equal dynamic value. 
Previous work amply justifies this. 

She was not foolish enough to attempt to draw any conclusion from her investigations as 
to the relative value of animal and vegetable food in the diets on the physical development 
of the individuals. 


Professor Pearson seems entirely unable to grasp the fundamental fact that the physical 
development of the individual depends largely upon his past conditions of life. To co-relate 
it with the special constituents of the food which he habitually eats will require not only an 
enormous series of studies, but a full investigation of the character of the various food stuffs 
and of the mode of cooking. 


These points I tried to explain to him when I wrote to him in summer. He did not 
write to me as, in his criticism, he says he did. Miss Lindsay forwarded to me a letter 
from him to her, and I wrote a reply to Professor Pearson which he did not acknowledge. 

In conclusion I would say that before he expects his criticism of a physiological problem 
to be taken seriously, he had better make some attempt to understand the nature of the 
problem. Certainly it is not my intention to waste time in replying further to his criticism 
unless in the future it is more pertinent than is his present contribution. 


II. The Statistical Study of Dietaries. A Rejoinder. 


By KARL PEARSON, F.R.S. 


I puBLISH Professor Noel Paton’s reply because it is very typical of the type of difficulty 
which we meet with at present, when we assert that what is really statistical work must be 
undertaken only by the adequately trained statistician and that when it is not, then the 
investigation cannot be considered as falling into the field of science. 


Professor Paton states that the following question given on p. 4 of the Report formulated its 
object: “Do the working classes of this city get such a diet as will enable them to develop into 
strong, healthy, energetic men, and as men, will enable them to do a strenuous day’s work; or 
are the conditions of the labouring classes such that a suitable diet is not obtainable?”... 


Now Professor Paton either assumes that the sample taken of the diet of the individual 
family was their customary diet, or he does not. If he does, then the question: Was the diet 
such ‘as would enable the working classes “to develop into strong, healthy, energetic men”? 
has meaning. If he does not, not only is it idle, but the section dealing with the physique of 
the children on the basis of a sample diet taken as a rule for a week (occasionally for a fortnight), 
is beside the point. 


But anyhow, I ask how he can possibly ascertain how the working classes will “develop into 
strong, healthy, energetic men,” if he does not take an adequate anthropometric survey of the 
families subjected to the dietaries recorded? He says that it is accepted and proved that “ If 
a family diet...gives a yield of energy of less than 3500 calories per man per day it is insufficient 
for active work ; and if less than 3000, it is quite inadequate for the proper maintenance of 
growth and normal activity.” He further assumes with Miss Lindsay that calories from animal 
and vegetable foods have equal “dynamic value.” I assert that neither of these conclusions, 


Miscellanea 173 


which he accepts, are based on adequate research and they are in fact refuted by Miss Lindsay’s 
own material. For, if it can be shown that animal and vegetable calories have different results 
on the physical development of the children, it is clear that the first statement as to how many 
calories are needful for the proper maintenance of growth has no significance until a statement 
is made with regard to the source of the calories. Professor Paton cites no evidence for his 
statements; from what I have read on the subject of calories, I feel convinced that most 
of the data on the matter would not stand for five minutes any adequate statistical analysis. 
The Report, Professor Paton tells us, shows “very markedly the relationship between the 
physique and the food.” Yet in a previous paragraph he says ‘that the physique is determined 
by the whole previous condition of life and by the influence of heredity, and that it is absurd to 
attempt to relate it solely to the diet.” 


Now the only way to ascertain whether there was a marked relationship between the food 
and the physique of the children was to correlate the two for a constant age and investigate 
whether the correlations were such, having regard to their probable errors, that they could be 
considered significant. I did this with the result that the total calories in the food and the 
girls’ weight for constant age was not definitely significant with regard to the probable error, 
while in the case of the boys the probable error was so large that it was impossible to say 
whether the relationship was really considerable or not. In fact no marked relationship could 
be deduced from Miss Lindsay’s data, they were too inadequate. If Professor Paton’s statement 
as to the influence of heredity is to be trusted, then even my correction for age was inadequate, 
and the data ought to be corrected also for physique of parent! If so, why was the parent not 
measured ? 

Professor Paton places before the readers of Biometrika two tables on which this “marked” 
relationship is asserted by him to rest. One of the cases in his Table A, No. 32, is erroneously 
placed in this table; the details show that the number of calories was 2949 and not 3822* ; 
it should be in Table B. These tables contain 16 boys’ weights and 20 girls’ weights. Professor 
Paton takes the British Association measurements, which are, of course, wholly inadequate as a 
test of Glasgow children, and making no real correction for age+ considers whether the children 
in the two tables were or were not above the quite arbitrary limit of 5 lbs. below standard. He 
gives us no measure at all of the significance of the result, which is based on the vagaries of 
sampling 16 boys of ages from 3 to 11, and 20 girls from 5 to 13; and he supposes in some way 
that this treatment can possibly refute the correlation coefficient, wo,» Of weight and food 
calories for constant age with its probable error! I can, however, throw more light on the 
matter. Owing to the great courtesy of Dr Chalmers, Medical Officer of Health for Glasgow, 
I have been able to more than treble the number of weights of the boys and girls subjected 
to the dietaries. The results for total calories in food, C;, now aret: 

Girls, 69 Boys, 55 
alec, = +21 £08, al uc, = +05 +09. 
Thus the relation for boys is now quite insignificant, and for girls may well be insignificant 
also. At any rate although both correlations are positive, there is no “marked” relationship 
between the physique and the dietary. Of course, it may be said that these weights (w) have been 
taken at some interval after the dietaries were recorded, but unless we assume the dietary to be 
a rough measure of the permanent feeding of the family, whose physique has been gradually 
built up for years before the dietaries were recorded, the observations must be discarded as of no 
value at all for testing physique, or as Professor Paton phrases it “development.” 


* In the Appendix V of Rickety Families, it is given again ; this time as 2329 calories, 

+ The deviation at each age would have to be measured in terms of the standard-deviation of weight 
at that age; naturally the deviations are larger for older children. 

+ I have to thank Miss B, M. Cave for the present series of correlations, 


Miscellanea 


= 
aT 
i 


But the most interesting point ascertained from the new material is the confirmation of the 
result that the higher the proportion of animal to vegetable calories the greater the weight. In 
Biometrika, Vol. 1x, p. 533, we had for 16 boys and 20 girls: 


Boys: alws Cy/C4= — 23416, 

Girls : alws Cy/C_= — 12°15. 
We now have for 55 boys and 69 girls: 

Boys: alws Cp/C4= ~ 380+ 08, 

Girls: ailing, Cy [C4 =— — 94. + ‘08. 


These results seem to indicate that Miss Lindsay and Professor Paton, who supports her view, 
are in error when they consider a calory the same whether it be from animal or vegetable food. 
On the other hand, our larger numbers now indicate that : 


(i) For a constant age the expenditure on vegetable or on animal food has no sensible relation 
to weight. 


(ii) For a constant age the number of calories in vegetable food has no sensible relation to 
weight. 


(iii) For a constant age the number of calories in animal food has a positive correlation with 
weight for both girls and boys, being definitely significant in the first case (+°32 4°07) and not 
so in the second (+08 + ‘09). 

(iv) For a constant age the correlations of weight with ratio of expenditure on vegetable and 
animal foods are for both boys and girls quite insignificant as compared with their probable 
errors. 

I am extremely obliged to Dr Chalmers for doing his best to supply additional material. As 
far as it goes, it tends to show that calories are of far more importance than expenditures, but 
that calories from animal food are more closely related to physique than are calories from 
vegetable food*. The new material supports my criticisms that the failure to distinguish 
between animal and vegetable calories stultitied the advice given by Miss Lindsay, i.e. to spend 
money on oatmeal rather than on eggs. It also indicates that no safe conclusions with regard 
to dietaries can be drawn until a reasonable anthropometric survey accompanies the record 
of dietaries, and the whole is reduced with adequate statistical knowledge. 


One point I can allow Professor Paton. It was an oversight on my part, when I said that 
I had written to both Miss Lindsay and to himself; the letters in which Miss Lindsay and he 
stated that to follow up the families now would be impossible were both replies to one and the 
same letter of mine addressed to Miss Lindsay. The additional facts I desired were in their 
opinion unascertainable, and further correspondence did not seem to me likely to be of any 
service in achieving the end I had in view, namely to render of real service to science a piece of 
recording work from which in my opinion then and in my opinion still, very misleading conclu- 
sions had been drawn, and which conclusions in their turn had been exaggerated in the press 
résumés of the paper. I do not think any such work as that done on dietaries by Miss Lindsay 
and Professor Noel Paton will be of real value until (i) these dietaries are accompanied by 
a thorough anthropometric survey of the whole families of the dieted and (ii) the equality of 
animal and vegetable food calories ceases to be considered as a dogmatic truth. 


* Of course the results show that on such data as are available, the food has relatively little relation 
to the weight, there is no ‘‘marked ” relationship. 


Miscellanea 175 


III. Note on the essential Conditions that a Population breeding at 
random should be in a Stable State. 


By K. PEARSON, F.R.S. 


Let us deal with bi-parental inheritance in the first place. Let « be a character in the father, 
mean 2, standard deviation o,; let y be the same character in the mother, 7 its mean, and gy» its 
standard deviation. Let z be the character in offspring of one sex, o3 be the standard deviation of 
all offspring of this sex and Z the mean. Let: pu’, jus’, pag’ 3 oe” os”) pea” 3 aN poe!” pag’, pa’”, be the 
moment coefficients about the means respectively of father, mother and offspring frequency distri- 
butions. Let 7,, be the mean of the offspring of those parents, who have characters w and y, and 
let the array of frequency of such offspring be given by fs (wv) du about 2,,, i.e. the character of any 
offspring in this array is 7,,+4, where uw is independent of the parental characters x and y, but 
Z,, is a function of w and y the parental characters. Some writers have suggested that the 
offspring character should be taken as a blend of the parental characters, i.e. 


z=}(et+y), 
understanding by blend the mean of the parental characters. This appears to be very unsatis- 
factory for: 


(a) It supposes the parental characters to fix absolutely the offspring characters which is far 
from a result of experience. 


(6) It supposes the mother to reproduce the female size of character in the male and the 
female offspring alike, whereas she contributes to each the sex character of her own stock, ice. if 
she is a tall woman, she would contribute absolutely more to a son than to a daughter. The late 
Sir Francis Galton got over this difficulty by “reducing female measures to their male equiva- 
lents.” This he did by altering absolute measurements in the ratio of male to female mean 
measurements. Thus he would take for the mean of his array of offspring 


a ih oa @ 
2ay =F ange 


if he were dealing with male offspring. A more reasonable hypothesis is to assume that 


This will practically agree with Sir Francis’s form, if the coefficients of variation in the two sexes 
are the same, i.e. 01/%=09/Y. 


If we measure wv from the mean of the array of offspring we have 


ee eyed, Ls 
Z=503 @ + a) etuehelesmmteh alo stasntiasielastioe seraranetnarins arene eb (11). 
We shall now suppose the offspring to follow the law (i), or 
poet = +4 =") 4 ee Gan 
1 2 


where w and y are uncorrelated (mating at random), and w represents other influences than the 
parental, and is therefore uncorrelated with # and y*. The frequency distributions of # and y 


* This assumes the homoscedasticity of the arrays of offspring due to pairs of fathers and mothers 
with characters « and y, 


176 Miscellanea 
may be taken as given by fi (v—2#) and fy(y—y). Let V,x WV, be the total number of possible 
matings ad w 
=SJi(%-2)foy-Z) dedy 
and the total number of offspring V3 in any array 
=f fs; (u) du. 


I now propose to give the expression for the zth moment coefficient about the mean, i.e. py”, 
of the population of offspring of a given sex. We have 


NV, x Nox N3x on”= |] [2 o3” co 2 =!) + uf ve (w— 2x) fo (y—Y) x fz (u) dadydu, 


the integration being extended over the whole of the frequency distributions of father, mother 
and oftspring. Thus 


t=n=s Jn—s ‘L(y —%)}n-8— BOON us 
pal” aa Hos res = = 8 iB feet ea oy" 8- tot os 


x fi (v7 — 2) fo (y— ¥) fs (u) dedy du. 


Now «, y and wu being independent we have 


1 = ‘ = 
iA | (w- EB) a (x az @) Au = pn —3- t 


: Y ; ” ” 
al Yy-¥)' hly-y) dy= pi 
: 8 fi lv 
N; | us fs (uw) du=pyg 
= g t=n— n-s 4 
is fey = 03" i. - 3 + | Pin-s—t Ht bet }] eer sonaes (iv). 


s=0 gr-3 |n—s|s t=0 jn- s—t|toy"- St gat os8 


Thus we reach, remembering that py! = py" =p," =0, 


i. 1 at bs ” 
Pe 4 om (, + Hs, + any Synveraisiesayefolstaseleleqelstctouncajatesalerecciotetenttats(s(e{etetctahsfa¥uialetstetalstetstetatetetl Rete ieeis (Vv), 
Hs = 3 o3° (# aF e) ipeall “ad aantearearoealemenitete te coduchiae ua ee ss: ete ae meee eee eee (v1), 
Hl d 2 is 5 if Des. ! 
Ha = 76 os! (A+0% et) +5 oy (PE +8 ) Hal ee oh heNeeeeere (vii) 


But po’ =077, pe” =o", and py” =o". Hence we must have 
pl? =4o52 a a'o{bisieiolelatafe, a¥a\s otesa(e ajalejs\etarele(aoleleiate,sie7= slelelnfsle(erafatsl stale (vill). 


If as usual we take 8, = p37/u.? and B,=4/p? we find from (vi) and (vii), writing s?= po'¥ 


/B rl ) {Va —3 Ve, ne Seen ERR acke ta inaah is (ix), 
4 
Byit = {2 uy = 74 (8! 4+ 8,” +6) — 32} REE ERR EES ER 0c (x) 
Whence by the use of (viii) 
/By¥ =2,/2 va” -5 war-+V,} ve ae ted! ik eae (xi), 


: Jer ac 
gVv=4 {182” — 16 (Bo + By! + 6) SIO. ai sosine satin ss heater cee (xii). 


Miscellanea 177 


Hence in order that the offspring population should be stable, it is needful that in the array of 
offspring for given parents : 


1 
(a) FTI) 03. 
(6) Nama, (Bi - 5 (B+ /B7)| =2y2/8,” (1-7) = 3 Wa” 


if 8,” =8,/=B,", i.e. the skewness be the same for fathers, mothers and offspring. 
i 1 mr 
B=; (78)" —15), 
if pee = Bo! =f," 
Thus, we have for the array of offspring of given parents 


1 
i op 03 


enor ae 
B= 5B, tones outta sae esac e caaioe ees ceavenges (xili). 


Peal, 
Be” —8= 5 (Br — 3) 


Accordingly the variability of the array is less than that of the population of offspring ; and 
the array (unless B,;'”=0, 8;’”=3) is more skew and has greater kurtosis than the general 
population. 

If 712, 723, 73, be the three correlations of father, mother and offspring we know that the mean 
standard-deviation of the offspring of arrays having the same parents is 


we 1 = 743 — 793" — 149? + 2712713731 
s'=05 tony cy Cee 
AY 


and this equals if there be no assortative mating 
ae epee 
(712=0), o3N 1 143? — 193°. 


If we could assume this equal to s we must have, since 


y 1 
=—- OG: 
a/ 2 39 
1 ls prepa aa S 
2 =N1—- 713° — 193"5 
2 
leading to 1132 +723°=4, 


or if the two parental correlations are equal to 
113 =123 = "9. 
In other words, if the parental influences were equal and there were no assortative mating 
and the character in the array of offspring had the mean value 


then the population could only be stable if 
713 =723=0°5. 
But this apparently noteworthy result only begs the question. By the general theory of 
correlation the mean of the array of offspring is 


ame 713712723 U— & 4 V3 —N12713 YY 
Lis? Vox 1-732 oy)” 


if there be no assortative mating, 


Cn? z 1138, 1938 
03 (rm te ros 3 = 103 (= ae 
oj 2 C1 a2 


Biometrika x 23 


178 Miscellanea 


Hence if we asswme the mean of array of offspring to be given by 


VW y 
aa (Z +2) 


(i) the second portion of the expression must be zero, i.e. mean of whole population of 
offspring must coincide with mean of array of offspring where parents have the mean values and 
(ii) we must have 73=723=%. In other words the form of our assumption involves both the 
equal influence of the parents and the value of the parental correlation. 


From the standpoint of heredity no such assumption is legitimate. Neither in Mendelian 
theory nor in biometric formula, nor again in actual observation is it permissible to suppose that 
the mean of the array of offspring is determined solely by the parents. Still less is it possible 
to suppose the actual character of the offspring to be the mean of that of the parents (i.e. put 
u=0). Ifit were we should have z=4(a+y), whence flow 


mr 1 ’ ” 
(Doan (12 + #2") 


m” | , ” 1 
Bs = (3 + ps3 ) tie e/6iaiulole ae elnie)sieje eisieieis.eiais/s sie jslelelelsisiainre (xiv). 


i 1 , , ” ” 
ba 76 (ua + Bpy me’ +4") 

But these equations assume that py, w3'Y and py” are all zero—an absurdity in itself and 
contrary to all experience, whether biometric or Mendelian. For non-assortative mating and 
equal potency of parents, they lead to parental correlations of the order °7 and to an impossibility 
of stability in any population*. 


In fact any such relations as (xiv) are inconceivable on the basis of both biometric as well as 
Mendelian theory and observation. Parental correlations have never been observed anywhere 
near such a value as 0°7. Equations (xiii) are, however, suggestive ; they show that if the 
parental distribution be symmetrical and mesokurtic, the array of offspring will remain so after 
selection; but if the parental distribution does not possess these characters, then any selection of 
individual parents will emphasize the asymmetry and the kurtosis in the resulting array of 
offspring ; or continued selection of this type will lead to greater and greater divergence from the 
normal or Gaussian frequency distribution. 


* If we assume that the mean of the array of offspring of parents of characters x and y is given by 


lx +my, it is only another way of asserting that the regression is linear and that 
_ 712 — 713723 3 _ 713 — 712723 63 


i n= 
1-137 oy’ 


1-193? 9° 


If we make /=m, or give equal weight to the parents, it is only rational to suppose that o;=c2 and 
T12=113, Which lead us to 


N16 O° 
=m=—2 3, 
1+ T93 01 
‘ : ‘12 93 
Hence the mean of the array is —— —(#+y), 
1+793 04 : 


and whether we make x constant and y constant or x+y constant leads to precisely the same variability 


in the array, i.e. 
: 1 = 19? — 1137 — 17932 + 27121713723 Qryo2 
s$=03 > ~=03 1-_—_, 
1 - 723" 1+793 


If assortative mating be zero, this equals 


03 VI = Qo? 
and, if to reach the results for u’” given above we put this zero, we must have 


r= N50=0°7 nearly. 


Miscellanea 179 


IV. The Hlimination of Spurious Correlation due to position in Time 
or Space. 


By “STUDENT.” 


In the Journal of the Royal Statistical Society for 1905*, p. 696, appeared a paper by 
R. H. Hooker giving a method of determining the correlation of variations from the “in- 
stantaneous mean” by correlating corresponding differences between successive values. This 
method was invented to deal with the many statistics which give the successive annual values 
of vital or commercial variables; these values are generally subject to large secular variations, 
sometimes periodic, sometimes uniform, sometimes accelerated, which would lead to altogether 
misleading values were the correlation to be taken between the figures as they stand. 

Since Mr Hooker published his paper, the method has been in constant use among those who 
have to deal statistically with economic or social problems, and helps to show whether, for 
example, there really 7s a close connection between the female cancer death rate and the quantity 
of imported apples consumed per head! 

Prof. Pearson, however, has pointed out to me that the method is only valid when the 
connection between the variables and time is linear, and the following note is an effort to extend 
Mr Hooker’s method so as to make it applicable in a rather more general way. 

If 21, @, #3, ete., W715 Y2, ¥3, etc., be corresponding values of the variables 7 and y, then if 
X, Ly, Xz, ete, Y1, Yo, Y3, etc. are randomly distributed in time and space, it is easy to show that 
the correlation between the corresponding th differences is the same as that between « and y. 

Let ,,D, be the nth difference. 

For 1D, = 21 — Vo, reer 1D,? = ar? = 24 Wy +209, 

Summing for all values and dividing by V and remembering that since 7, and «2 are mutually 
random S (2, #2) =0, we gett 


Again, Dy= I - Yoyo Dar Dy = "19, — L211 — M1 Yot L272. 
Summing for all values and dividing by NV, and remembering that 7, and y. and a, and y, are 
mutually random 
FDigDy Dp {Dy 2ey Fx Fy 


peie/ Tenis 
DD, ey, 


Proceeding successivel ? =? SNe SSAA oe Saisncuieeecnice Bata eters asia tons ID 
2 y ne Dy, rest OF n-1Dy ad ( ) 


“rie 
Now suppose 2), 22, #3, etc. are not random in space or time; the problems arising from 
correlation due to successive positions in space are exactly similar to those due to successive 
occurrence in time, but as they are to some extent complicated by the second dimension, it is 
perhaps simpler to consider correlation due to time. 
Suppose then v= X,4+bt,+ct?+dtitete., «,= Vo+bto+cty?+dt3+ete, 
where X,, X,, etc. are independent of time and ¢,, ¢,, t3 are successive values of time, so that 
t, —tp_4= 7, and suppose y,= Y,+ 04, +¢t,?+ ete. as before. 


* The method had been used by Miss Cave in Proc. Roy. Soc. Vol. Lxxiv. pp. 407 et seq. that is in 
1904, but being used incidentally in the course of a paper it attracted less attention than Hooker’s 
paper which was devoted to describing the method. The papers were no doubt quite independent. 

+ The assumption made is that n is sufficiently large to justify the relations 

S4"71 (x) /(n — 1) = Sy” (x) (mn — 1) = Sy" (x)/n_ and S{"—! (a?)/(n — 1) = Sy” (x?) /(n — 1) = Sy" (x?) /n, 


being taken to hold. 
232 


180 Miscellanea 


Then 1D, = Dy -bT — eT (t, + te) — dT (t?7 + tte 4%”) — ete. 
1D, =,Dx—{bT+cT? +dT3 + ete.} — & {2¢7'+3dT? + 4eT + ete.} 
—t,° {3dT + 6eT? + etc.} — ete. 
In this series the coefficients of ¢,, t2, etc. are all constants and the highest power of 4, is one 
lower than before, so that by repeating the process again and again we can eliminate ¢ from the 
variable on the right-hand side, provided of course that the series ends at some power of ¢. 
When this has been done, we get 
nDx=nDy +a constant, 
nDy=Dy +a constant, 
BO Hei apy = Ds »D % => lxy) 


‘ 20) wd . =r Pn oo » (0) 5 rayle Ss 7 
and of course PD ety =e UD DY for ,D, and ,D, are now random variables independent 


zn 
of time. 

Hence if we wish to eliminate variability due to position in time or space and to determine 
whether there is any correlation between the residual variations, all that has to be done is to 
correlate the Ist, 2nd, 3rd...2th differences between successive values of our variable with the 
Ist, 2nd, 3rd...xth differences between successive values of the other variable. When the cor- 
relation between the two rth differences is equal to that between the two (n+1)th differences, 
this value gives the correlation required. 

This process is tedious in the extreme, but that it may sometimes be necessary is illustrated 
by the following examples: the figures from which the first two are taken were very kindly 
supplied to me by Mr E. G. Peake, who had been using them in preparing his paper “The 
Application of the Statistical Method to the Bankers’ Problem” in The Bankers’ Magazine (July— 
August, 1912). The material for the next is taken from a paper in The Journal of Agricultural 
Science by Hall and Mercer, on the error of field trials, and are the yields of wheat and straw on 
500 345 acre plots into which an acre of wheat was divided at harvest. The remainder are from 
the three Registrar-Generals’ returns. 


I Il Ill IV V VI 
Correlation between ... Sauerbeck’s | Marriage} Yield of Tuberculosis Death Rate. | 
Index numbers. Rate Grain 
Infantile Mortality 
and wits ... | Bankers’ Clear- | Wages | Yield of 5 
ing House Straw | — 
ULI ees Ireland England | Scotland 
head 
Raw figures ss — °33 — 52 +°753 3 "35 +02 
First difference... +°51 + ‘67 +590 +°75 + °69 +°51 
Second difference ... + °30 +°58 + °539 74 ‘74 +°65 
Third difference ... + :07 + °52 +°530 — — — 
Fourth difference ... +11 + 55 + °524 — — —_ 
Fifth difference ... + 05 + °58 — — — — 
Sixth difference... — +°55 = = — — 
Number of cases | 41 years 57 years Ly 42 years 
ae eed years! plots Sa) ae 


The difference between I and II is very marked, and would seem to indicate that the causal 
connection between index numbers and Bankers’ clearing house rates is not altogether of the 
same kind as that between marriage rate and wages, though all four variables are commonly 
taken as indications of the short period trade wave. 1 had hoped to investigate this subject 
more thoroughly before publishing this note, but lack of time has made this impossible. 


Miscellanea 181 


V. On certain Errors with regard to Multiple Correlation occasionally 
made by those who have not adequately studied this Subject. 


By KARL PEARSON, F.R.S. 


(1) Iv is well-known* that if we endeavour to predict the value of a variate xv) from a 
correlated variates 21, 2, ... Y,, by determining a linear function of #7, #2, ... v, which has 
the maximum correlation #, with wz, then the value of #,,? is given by 

: 
‘ h,?=1—A/Ago, 
where A is the determinant 


— 1 5) TOL) M025 eee Ton 


Pity 0 5) Ry oon Pate | 


| Troy Tnty nds oe 1 | 
and A,, is the minor corresponding to the constituent of the pth column and gth row. 


The system I propose to consider is that in which all correlations like 7, are equal, whatever 
p be, toa constant p, and all correlations 7,,,, where p and g may take any values from 1 to n, 
are the same and equal to «. We now have for the value of A the expression 


Wl Pye (Ps) ees Ps | 


| 
To evaluate this determinant add all the rows but the first together, giving 
nap, l+(n—l)e, 1+(n—l)e, ... 14+(m—-l)6, 


multiply the result by p/(1+(2—-1) €) and subtract from the first row. We have 


np* ; | np? 

= —— OFF Oe OF — aa 0. 

l+(n—le’ ’ , | {1 11s x Aon 
Ps l, ¢, || 
Ps €, 1, € | 
BNajoidie sraseuiscia gee aneicstesejmee eelicow camer \ 
Ps €, €, 1 | 

ane 2 
Hence fe a (he = i a i 
1c n ( T+(n—l)e AGEING Too eer (a), 


/ n - 
or R,t=p Tes (eat hPa wetiecees (ii). 


* Biometrika, Vol. vit. p. 439. 
+ The sign of R, must be determined from other considerations. 


182 Miscellanea 


Thus if 7 variates are equally correlated (e) among themselves, and equally correlated (p) with 
another variable, we shall not indefinitely increase the accuracy with which the last variable will 
be predicted from the others by increasing indefinitely the number of the variates 2. 


illustration. The coefficient of multiple correlation is required as we increase the number of 
brothers from whom a prediction of a character in a given brother is made. The fraternal 
correlation =°5, 


Number of Brothers R, 
1 “5000 
2 “5774 
3 “6124 
4 “6325 
5 6455 
6 6547 
10 ‘6742 
a “7071 


Compare against these results ¢wo parents only in a population where there is no assortative 
mating and the parental correlation="5. Here e=0, p="5 and n=2, .-. R=4$,/2=°7071, or two 
parents will give more information than 10 brothers and sisters, and as much in fact as an 
indefinite number. Suppose the parents tend to select their like, i.e. suppose there is assor- 
tative mating in the population, say, e=+15, then with the same intensity of parental correlation 


2 = 6594, 
or, two parents will give us more information than six brothers and sisters. 


Now this illustration brings out the real nature of the effect of increasing the number of 
variables from which we predict. Such increase has very little value, if those variables are 
fairly highly correlated with each other. To be effective they must be highly correlated with 
the variate we wish to predict and correlated very slightly with each other. 


Even in this case there is a limit to the degree of correlation reached when the number of 
variates is indefinitely increased, namely p/,/e, and it is clear that if p be small and e fairly large, 
no very great increase of correlation is obtained if we use an indefinitely great number of variates. 
For example if p='05 and «=°5, we find &, =-0707 only. Even if p were ‘10, we should only 
raise R to ‘1414, could we predict from an indefinitely large number of such correlated variates*. 
Indeed as long as ¢ is not less than p we gain singularly little by combining large numbers of 
variates, For example if p were ‘4, and e=°4 ten such variates would only raise the correlation to 
5898, and an indefinitely large number to ‘6325, which is less than double the single correlation. 
Yet there are apparently many persons who believe that by taking a number of low correlations, 
a high relationship can be reached ! 

Actually there is a limit.to what relations can possibly exist between a variate xv and a series 
of equally correlated variables x, ... 7,. Since & must be less than unity, we have 


py aes 
PON) SV Gielen ats 


2 
np* —1 
or e> f ° 
n—-1 
Thus if 72=10 and p=‘5, « must be >'1667. Or, it would be impossible for 10 variates to 
have a correlation ‘5 with another variable, and a zero correlation with each other. 


* Even if p were ‘10 and ¢ as low as ‘10 we should not raise R for endless variates of this order of 
correlation above -3163, while from compounding ten such variates we should only obtain a correlation 
about double that of a single variate, i.e. R=+2294, 


Miscellanea 183 
If we suppose a number of variates » to be uncorrelated with each other, but correlated 
115 102) «++ “on With another variable w), then we have from the determinant as given below 


2 cee + 2 
==i(l » Tot» To2> e%* Von =(1 — 749 — lo Sesicsaee UI ) Dou. 


iy bes 0) 5 oon 0) 


eevee eee eee eee) 


, ‘ 5 9 
eae: he= More+ Poy? + seeita Tony 
or R=Jn ih Myo tit FT on" 


Therefore, if 2 variables, uncorrelated among themselves, be correlated with an additional 
variable, it is necessary that the root mean square of their correlations should be less than 


55 We see therefore that it must either be impossible to find a large number of variables 
n 


uncorrelated among themselves, which are correlated with an additional variable, or else their 
correlations with this variable must be extremely low. The last result shows us the fallacy 
of supposing that correlations are simply added together for a combined effect ; clearly when 
the variates are uncorrelated among themselves, we add by the sum of the squares. For 
example, if 7) =792=...-=70n='03 one hundred such variables would only raise R to 30, On 
the other hand if the variates are highly correlated together, say e=°81, an indefinitely great 
number of such variables would only raise the multiple correlation to ‘0333, if the individual 
correlation were ‘0300. 


We are now in a position to apply our results to the problem of the relative intensity of 
heredity and environment. This problem has been singularly misunderstood especially by the 
popular exponents of Eugenics. Some illustrations of this may be given here. Major Leonard 
Darwin writes as follows in the Journal of the Eugenics Education Society: “It is impossible 
to compare heredity as a whole with environment as a whole as far as their effects are 
concerned ; for no living being can exist for a moment without either of them*. Moreover, 
in order to compare two things so as to be able to use the words more or less in connection 
with such a comparison, we must have a common unit of measurement applicable to them 
both. But what is the unit by which both heredity and environment may be measured ? 
I myself have no idea. May we not be discussing questions as illogical as enquiring what 
portion of the area of a rectangle is due to its width and what to its length? Js 7t ever wise 
to use words in scientific literature without endeavouring to attach a definite meaning to themt ? 


It is hard to conceive a paragraph of the same length more full of evidence of complete 
ignorance of the methods used in modern science for comparing correlated variates! Yet it 
goes out as the opinion of the President of a Society which is endeavouring to spread the 
scientific doctrines of Eugenics among the people! Major Darwin begins by stating that it is 
needful to have a common unit of measurement in order to compare two variates. To begin 
with we are not comparing two things, but we are comparing the influence of two things on 


* There would in our sense be no heredity if the average child born to noteworthy parents was equal 
to the average child of the whole community. Yet it is perfectly easy to understand how living beings 
could exist under such a law of reproduction. Major Darwin seems to be confusing two things, the fact 
that a man is born true to his species, and the fact that he resembles his immediate ancestry. It is 
the latter fact only which concerns us when we compare heredity and environment, i.e. how variation of 
immediate ancestry affects the individual’s physical or mental characters. But without such heredity 
individuals might quite well exist. 

+ The Eugenics Review, Vol. v. p. 152. The italics are mine. 


184 Miscellanea 


a third, ie. the intensity of a certain environmental influence and the intensity of a certain 
somatic character in the parent, say, on the intensity of the somatic character in the off- 
spring. Yet Major Darwin tells us we cannot do this because we cannot measure these 
things in the same unit !—How suavely yet forcibly Sir Francis Galton himself would have 
ridiculed such ignorance in high places as is passed by the Editor of the Eugenics Journal !— 
We can hear him now telling us how the intensity of each character could be measured by 
its grade, and how the problem turned on whether the same change in grade in the environ- 
ment and in the parental somatic character produced greater or less change in the grade of 
the filial somatic character. When we inquire whether inter-racially stature is more closely 
related to cephalic index or to eye colour, are we to be met by the statement that these 
characters cannot be compared because they cannot be measured in a ‘common unit,’ and 
then be told that it is not “wise to use words in scientific literature without endeavouring to 
attach a definite meaning to them?” Every trained statistician knows that each character 
is measured in the unit of its own variability—in what he terms its standard deviation*, 
and that this standard deviation provides him with a measure of the frequency of each value 
of the variate in question. It seems to me that the only correct sentence in this paragraph, 
is the author’s statement that he himself has no idea what unit is ‘common’ to heredity and 
environment. 


But our author continues : 


“Take any quality, and we find that the human beings composing any community differ 
more or less considerably as regards that quality. Now we can measure the correlation 
between the differences shown in this quality and the differences of environment to which 
the members of the community in question had previously been exposedt. This is one 
correlation. Then we can also measure the correlation coefficient between, say, father and 
son, as regards the quality in question. Here is a second correlation; and if we are told 
that the relative influence of environment and heredity is measured by the ratio between 
these two correlation coefficients, we certainly do thus get a clear conception of what is 


+ 


meant }. 


But has the writer really obtained a clear conception of what such coefficients of correla- 
tion mean, when in the next paragraph he continues : 


“Tmagine an ideal republic, in some respects similar to that designed by Plato, where not 
only were all the children removed from their parents, but where they were all treated exactly 
alike. In these circumstances none of the differences between the adults could have anything 
to do with the differences of environments, and all must be due to some differences in inherent 
factors. In fact the environment correlation coefficient would be nil, whilst the hereditary 
correlation coefficient might be high §.” 


Could any better evidence be adduced that the President of the Eugenics Education Society 
did not know what a coefficient of correlation meant at that date? The coefficient of correlation 
for the environment might be anything from —1 to + 1; the only obvious fact would be that you 
could not find its value, except in the form 0/0, from an environment which precluded any 
measure of variation. How again Sir Francis would have smiled at the notion that the 
coefficient of correlation for a constant environment must be nil. Why should we follow such 


* Of course he may or does need other constants to help in the description of the frequency. 

+ loc. cit. p. 153. 

+ This seems to contradict the writer’s previous assertion that two things are incomparable, if they 
have not a ‘common unit’! 

§ I wrote at once to Major Darwin pointing out the error of such a statement and he withdrew it in 
the next number. But the harm done by an article of this kind cannot be reversed by correcting a 
single misstatement. 


Miscellanea 185 


advice as that given by the President of the Society to avoid as far as possible “such phrases as 
the relative influence of heredity and environment,” when on his own showing he does not in 
the least appreciate the methods by which this relative influence is measured ? 


Then Major Darwin continues : “Surely what we want to know is how we can do most good— 
whether by attending to reforms intended to affect human surroundings, or to reforms intended 
to influence mankind through the agency of heredity. But does this ratio [that of the environ- 
mental and hereditary correlation] give us any sure indication of the relative amount of attention 
which should be paid to these two methods of procedure?” Our only reply can be that these 
correlations certainly do, and that as long as the President of the Eugenics Education Society 
fails to grasp their meaning, he is doing grave harm to the science of eugenics. 

We measure the change in the character of an individual which would be produced by a 
change of a like or an allied character in a parent, such change being one of which we have 
experience ; we measure the change which would be produced in the character of the individual 
by changes in the environment such as we have experience of, i.e. when we move the individual 
from a badly ventilated to a well ventilated house, from a back to back to a through house, from 
a low wage to a high wage, and so forth, and we find the resulting changes are of a wholly 
different order in these cases to what happens when we change the physical characters, the 
health or habits which define the parents. It is on the basis of this that we assert that the relative 
strength of heredity is far greater than the strength of environment. To this reasoning, apart from 
such arguments as the above or those to be immediately dealt with, reply is only made by talk as to 
the impossibility of an individual surviving if you deprived him of his normal environment! It 
would be just as reasonable to assert that everything must be due to heredity, because a race of 
supermen would breed supermen ! What the scientific eugenist has endeavoured to measure are 
the influences of such range of differences in environment as occur in everyday experience and 
are therefore producible from the political, economic and social standpoints, not the absence of 
all environment at all. But while this is recognised by some of the popular eugenic writers, they 
have approached the problem from another standpoint which indicates equally how little they 
grasp modern statistical theory. We admit, they say, that the environmental correlations may 
be of the order ‘03 or :05 and the inheritance correlations of the order 50. But this is the 
correlation of one character in environment. You ought to take ten or twenty, and then you 
will have multiplied up environment to be more effective than heredity, for 03 x 20=-60. In the 
first place we may suggest that it would be just as reasonable, if the argument were a valid one 
to multiply up the favourable hereditary characters, to take weight, height, muscular activity, 
health, intelligence, caution, and many other desirable factors, and these not only in one parent 
but in brothers, sisters, aunts, uncles and grandparents and treat the cross-correlation of these 
with the character under discussion. But although every improvement in stock would reflect 
itself in improvement in offspring, correlations cannot be added together—any more than forces 
by simple arithmetical addition. You do not combine two hereditary correlations any more than 
two environmental correlations by mere addition. You must proceed by the combinatory process 
indicated at the commencement of this paper, which is one of course familiar to every trained 
statistician. : 

Yet here is a statement which the Editor of the Hugenics Review admits to its pages without 
contradiction * ; 

The point that we wish to make is this, In the face of so much ignorance concerning, not only 
heredity itself, but also its complement, the influence of environment, how can any one be justified in 
making sweeping generalisations with reference to these subjects? 

Such generalisations, however, are made. It is said that we have a definite proof that inheritance is 
of far greater strength than environment. This argument takes the following shape. The correlations 
between parent and offspring for a number of features have been calculated, and the mean is found to 


* Vol. v. p. 219, in an article by A. M. Carr-Saunders. 


Biometrika x 24 


186 Miscelianea 


be somewhere about °5. Correlations between individuals and various aspects of their environment have 
also been worked out—as, for instance, mental ability and conditions of clothing, or between myopia and 
the age of learning to read*—and the mean value is found to be about -03. It is then said that the 
mean ‘‘nature value” is at least five to ten times as great as the mean ‘‘nurture value,” and upon this 
is founded the generalisation that ‘‘nature” is of far greater importance than ‘‘nurture’+. It may be 
questioned, however, whether such a comparison does not involve a serious misiake. For if we consider 
the two mean values that are compared, we find that, whereas the ‘‘mean nature value” is the mean 
value of a number of observations, all of which provide a full measure of the strength of heredity, the 
‘mean nurture value” is the mean value of a number of observations, each of which measures only the 
strength of some one isolated aspect of environment. It would appear then that the full strength of 
inheritance has been compared, not with the full strength of environment, but with the average of a 
number of small isolated aspects of the latter. Asa matter of fact it is quite beyond our power at 
present to sum up the full effect of environment upon the individual and compare it with the full effect 
of heredity. We are, therefore, justified in saying that we neither know in particular cases how far the 
environment can produce any effect, nor can we make any definite statement as to the comparative 
strength of ‘‘nature” and “nurture.” 

Now this is the doctrine passed by the Editors of the Hugenics Review, the journal of a 
society, which has assumed the mantle of Francis Galton{, and it is passed, because the 
editorial committee of that society does not grasp the meaning of multiple correlation! The 
passages in italics have been so printed to draw our readers’ attention to them. In the first 
place, of course, a single correlation coefficient does not provide a full measure of the strength 
of heredity. In the table cited the coefficients are those for one parent or for one brother or 
sister. Each relative—and those for independent stocks are either non-correlated or inter- 
correlated very slightly—provides such a coefficient, and further each character in such relatives 
may be correlated with the character under discussion in the subject in question. In the; next 
place the environment factors do not consist of “some one isolated aspect of environment.” 
All these factors or aspects are closely interlinked, and this was a fact well-known to the 
workers in the Galton Laboratory. The real interpretation of such a difference as 560 and °03 
in the average values of single coefficients can only be appreciated by those who are conversant 
with the theory of multiple correlation, and it is quite clear that those who profess to guide the 
public in this very difficult problem—which is essentially a scientific problem—lack any adequate 
knowledge of the sole instrument by which any conclusion can be drawn. : 

The writer appears to be wholly ignorant of the nature of multiple correlation in the first 
place, and in the second entirely to overlook the very high correlations which exist between 
environmental factors. Bad wages, bad habits, bad housing, uncleanliness, insanitary sur- 
roundings, crowded rooms, danger of infection, etc., etc. are all closely associated together, 
and while the order of correlation between environmental and physical characters is low, that 
between individual environmental factors is in our experience very high. Thus the problem of 
multiple correlation illustrates closely the theory developed in the first part of this note; we 
have to deal with a low p and a high «. 


For example, if we take the environmental factors to have an average inter-correlation of °70, 
then an infinity of such factors for a mean environmental and individual correlation of ‘03 would 


* As the writer phrases this correlation, it is very liable to be misinterpreted. What the Galton 
Laboratory did was to show that myopia was very markedly inherited, and that the theory that it was 
largely due to school environment was incorrect, because children who began to read late, i.e. went late 
to school, were not less myopic than those who went early. 

+ Karl Pearson, Nature and Nurture, Eugenics Laboratory, Lectures vi. p. 25, 

+ If there was one point on which Francis Galton felt strongly and wrote it was on this point of the 
relatively great intensity of ‘‘nature” as compared with “nurture.” I do not stand alone in recognising 
it as an essential part of his teaching: ‘I am inclined to agree with Francis Galton,” writes Charles 
Darwin, ‘‘in believing that education and environment produce only a small effect on the mind of 
anyone, and that most of our qualities are innate.” 


Miscellanea 187 


only raise the correlation to 0359 against a s¢ngle parental correlation of *5000; if the correlation 
was ‘05 instead of ‘03, we should have the total possible environmental multiple correlation ‘0598 
as against ‘5000. Even if we raise the average environmental correlation to ‘1 and the inter- 
environmental factor correlation be reduced to °5, the multiple correlation of an infinity of factors 
is only :1414 as against the sengle factor of heredity °5000. Even if we could pick out one 
hundred environmental factors which had no inter-correlations—which experience shows is 
wholly impossible—and each of these independent factors was correlated to the extent of -05 
with the mental or physical characters of an individual they would only just reach the hereditary 
influence of a séngle character in a séngle parent. 

Now let us suppose an absolutely idle case, namely that the environmental factors had the 
same correlation as a parent, i.e. 5, with the character of the individual, and only a correlation 
of °6 with each other, then if we could use an indefinitely great number of such factors the 
multiple correlation would only be ‘5//°6=°6455, while the correlation with two parents, with 
no assortative mating, would be -7071. Even with assortative mating, it suffices to take only 
the four grandparents into account to show that heredity acts in excess of an environmental 
scheme even so preposterous as is suggested above. If we take the parental correlations *50, the 
grandparental ‘25, and those of assortative mating -15, we have for the determinant : 


A=| 1, °50, 50, ‘25, -25, -25, -25 
(60,1; 15). "50; “50, “0; 0 
| 50, ‘15, 1, -0, . 0, ‘50, ‘50 
5, 50, 0, « 2 


i 
25, 0, ‘50, 0, 0, ‘15, 1 


Add together the second and third rows multiplied by 3951, and the fourth, fifth, sixth and 
seventh multiplied by ‘0456 and subtract the result from the first. The first row then becomes 


| 5593, 0, 0, 0, 0, 0, O| 


the others of course remaining the same. 


Hence NOD 9B Anne 
and R?2=1— A/Apy =1—°5590 = 4407. 
Therefore R:=='6639. 


Or together grandparents and parents would influence a man’s character more than an 
5S 
infinity of environmental factors of the same grade of correlation, because the latter factors 
are far more highly correlated together than several of our relatives. 


Actually of course we are dealing with average values; the average value of environmental 
correlation with individual character being in our experience of the order ‘03 to ‘05 and the 
Lod 


inter-environmental factor correlations of the order 5 to ‘7. But these averages enable us to 
appreciate the total effect. 


The doctrine taught by the writers in the Hugenics Review, that we know nothing of the 
relative intensity of environment and heredity and that it is unwise “to use words in scientific 
literature without endeavouring to attach a definite meaning to them” only demonstrate how far 
the Editors of that Journal are removed from any appreciation themselves of modern statistical 
methods. How far the doctrine is removed from the very strong views held on this point by 
Francis Galton, only those who have studied his writings and know how strongly he felt person- 
ally on the subject are in the least competent to appreciate, 


24—2 


188 Miscellanea 


VI. Formulae for the Determination of the Capacity of the Negro 
Skull from External Measurements. 


By L. ISSERLIS, B.A. 


§ 1. Formulae for the determination of the capacity of the human skull from external 
measurements, were obtained by Lee and Pearson*. The material they employed consisted 
of various series of measurements of Bavarian, Aino and Naqada skulls. Measurements of 
Ancient and modern Egyptian and other non-European skulls were employed, chiefly for 
purposes of comparison. The formulae, some of which will be quoted later, were intended 
primarily for the prediction of the capacity of European skulls, from external measurements. 
Doubt has been thrown on several occasions on the applicability of these formulae to the Negro 
skull, one of the reasons alleged being the supposed difference in thickness of the bone of 
European and Negro crania. 

The publication t of the late Dr R. Crewdson Benington’s researches on the negro skull has 
made it possible to obtain similar formulae for negro skulls, and to test how far these can 
be applied to the prediction of the capacity of European skulls and conversely to test the 
applicability of Lee and Pearson’s Equations to the negro skull. 

§ 2. The material is fully described in Dr Benington’s Study. The crania dealt with in 
the present paper are Benington’s series A, B, C. 

A. Congo Crania in the Royal College of Surgeons. These crania provide 46 males and 
and 21 females, as owing to various defects no capacity is available for numbers 25, 38, 48, 54 
among the males and numbers 69, 72, 75, 79, 82, 85 among the females. 

B. Crania from the Gaboon, Group I, brought by Du Chaillu from Fernand Vaz in 1864. 
Of the 50 male and 44 female crania in the series, 2 males (numbers 3 and ?) and 1 female 
(number 2) are defective, leaving 48 male and 43 female crania available. 

C. Crania from the Gaboon, Group LI, brought by Du Chaillu from Fernand Vaz in 1880. 
Two of the 18 males (numbers 12a and 20) and two of the 19 females (numbers 8 and 18) 
are defective. 

Altogether 110 male and 81 female crania have been dealt with. The correlation has been 
calculated of the capacity (C) and the product of the breadth, length and total height (B, Z 
and #), for each group and for the aggregates of 110 male, and of 81 female crania. 

Correlation coefficients have also been calculated for the capacity and breadth, capacity and 
length, and capacity and total height, but for the aggregates of the three groups only. Re- 
gression formulae are given in all cases. It is to be observed that Dr Crewdson Benington’s 
measurements of capacity were taken with mustard seed, packing and measuring glass and 
that the error of measurement or rather his average difference as compared with other workers 
in the Biometric Laboratory was under 10 cm*. 

In comparing the regression formulae obtained here, with those given by Lee and Pearson for 
European and other skulls it must be remembered that in all their formulae except (12) and (18) 
of p. 247 they employed the auricular height and not the total height. In the present paper as 
in Dr Benington’s study H denotes the total height. Lee and Pearson denote this by #’ and 
use # for the auricular height, 

It was not possible here to use the auricular height as it was not available for the whole of 
the Gaboon series B and C. 

* Phil. Trans. Vol. 196, Series A, pp. 225—264. 
+ Biometrika, Vol. vit. Nos. 3 and 4, Dec. 1911. 


Miscellanea 189 


Taking first the male skulls, the mean value of the capacity and the product BLH, their 
standard deviations and the correlations are given in the following table. 


TABLE I. 
Mean capacit | Mean value of : ; in 
in as z BLH incom} | %e 2 em. nae "C, BLU 
46 Congo Skulls... 1344 3303 126°22 282°99 872 
48 Gaboon (1864) ... 1379 3295 108°30 | 230°30 *822 
16 Gaboon (1880) ... 1447 3463 10960 266-42 308 
110 Negro skulls... 1375 3323 120°74 265 °20 "842 
The corresponding regression lines are 
for the 46 Congo ae3 C=‘00038889BLH+ 59 +o SCRE EECCaTE (1), 
n 
48 Gaboon (1864) | C=-0003865BLH+ 105% 4 eee (2), 
n 
16 Gaboon (1880) | 0=-0003323BLH4297 + occ (3), 
vn 
+A 
110 male negro skulls C='0003849BLH+ 96 ao web eneee caseneade (4). 
n 
Lee and Pearson’s corresponding equation for males is 
C= 000266 L BH’ + 594-6* vo. ccscececsscesceccescescesscecenes (P). 


This is not a regression line, but is obtained by method of least squares from the results for 
various races in their table 20. 


The formulae 1—4 can be used to predict the capacity of an individual skull from external 


measurements. The probable errors of the mean were calculated by the formula 0°674490, — 
N 
where # is the number of skulls in the group to which the formula is applied. If we substitute 
in (1)—(4) the mean values of B, LZ, H for the Bavarian male skulls used by Lee and Pearson, 
Viz. : 
B =150°5, 
L =180°, 
H=133'8, 
5 41 
we obtain, from (1), C=14744+—= 
Vn 
np Op C=1471 #82 
Vn 
» (3), c=1506+ © 
Vn 
» 4) v=14964 >, 
Vn 


* Loe, cit, Equation (12). H’=total height. 


190 Miscellanea 


The measured capacities of these German skulls have a mean value of 1503 c.c. a result 
which is in very close agreement with (4) the formula based on 110 skulls. 1508 is the mean 
capacity of 100 skulls so that === 655. Thus the difference between the actual mean capacity 

N10 
of German skulls and the mean capacity estimated by the negro formula is less than 10 cm’. 
although the mean capacity of German male skulls exceeds that of negro males by 


1503 —1375=128 cm’. 
If the above values of B, Z, H are substituted in Lee and Pearson’s formula (P) on p. 4 
we obtain C=1492. 


On the other hand if we substitute the mean values of the dimensions of the 110 male negro 
skulls, B=137, 2=178, H=135 in formula P we obtain C=1400 as compared with the measured 
mean of 1375. 


This is not as good a reconstruction as our formula (4) or as the formulae of Lee and Pearson 
employing auricular height, and is probably due to the fact that P is obtained by the method of 
least squares from 11 means only. 


§ 4. An approximation to the influence of the thickness of the bone of the skull on pre- 
dictions of capacity from external measurements can be obtained by differentiating the equation 


C=hkBLH+const. 
and putting dB=dL=dH=t. 
We obtain dC=k(BL+LH+ HB)t, 


or if we observe that in the equations the constant is comparatively small 


dC 1 1 1 
C= G ry 38 7H) t 
with B=150°5, L=180°6, H=133'8 


de 
1500 — 


t (02) approximately. 
Thus a difference of 10 cm*. in capacity corresponds to a difference of 4mm. in thickness 
which is about 5°/, of the thickness (say 6 mm.) of the human skull. 


We may fairly conclude then, that there is no appreciable difference in the thickness of the 
negro skull as compared with the European. 


§ 4. The female crania yield very similar results. The following is the table for the female 
skulls. 


TABLE II. 
| Mean capacit Mean BLH : : Giana 
in tnt s in cm.3 oD OS om. "cy BLH 
21 Congo Skulls... 1206 2858 LOZ e Se 90 
43 Gaboon (1864) ... 1232 2924 126°7 270°95 8814 
17 Gaboon (1880) .., 1240 2964 97°31 265°8 8560 
81 Negro skulls... 1227 2956 117 255°72 "7668 | 


Miscellanea 191 


The corresponding regression lines are 


21 Congo skulls C= 0003645 BLH+164+ E> Maseeameneace these (5), 
Jn 

43 Gaboon (1864) | C=-0004122BLH+ 27+ aw Teen, (6), 
Vin 

17 Gaboon (1880) C= 0003134 BLH4+311+ = sea ea Sessatiente teenies (7), 
vn 

81 Negro skulls C= 0003508 BLH 42044 ieee (8). 
vn 


The corresponding Lee and Pearson formula obtained by the method of least: squares is 
C2 OO0USC LBA 7812 cciccceccsscscovccessezsssesesccescenss (Q). 
The mean values of B, L, H’ for the Bavarian female skulls discussed by Lee and Pearson are 
B=144-11, 
L =173'59, 
Hf’ =128°07. 
With these values, we deduce from 5—8 the following values for C. 
45 
(5) C=1331+ 


ww 
(6) C=1347+ = 
vn 


(7) C=13154— 


The mean of the measured values of the capacities of these skulls is 1337 and formula (8) 
based on 81 negro skulls gives a result in very close agreement. 

If the above values of B, L, H’ are substituted in Lee and Pearson’s formula @ we obtain 
C'=1284 a result which differs from the true value much more seriously than the prediction by 
the negro regression formula. 

Again, if we insert the mean values 

B =130°75, 

£ =171°38, 

H=129°'81, 
of the 81 female negro crania in the formula Q we get C=1266 as against the mean of the 
measured values which is C=1227, demonstrating again the fact that the formulae P, @ based 
on 11 means are not as good as the regression formulae. 


§ 5. We add tables of the correlation between capacity and breadth, capacity and length, 
and capacity and total height for the 110 male and the 81 female skulls, and for comparison 
reprint the corresponding value for German (Bavarian) skulls. 


Miscellanea 


TABLE III. 
Correlation Males. 
Negro German 

Capacity and Breadth 4977 6720 
Capacity and Height 6080 (total height) 2431 (auricular height) 
Capacity and Length 7433 5152 

TABLE IV. 

Females. 
Negro German 

Capacity and Breadth 7578 | “7068 
Capacity and Height 5450 (total height) | 4512 (auricular height) 
Capacity and Length 6699 | ‘6873 


The corresponding regression lines are given in the tables below : 


TABLE V. 
Males. 
Negro German 
(9) C=12°6356B-—3561+202 | C=13-432B—517°34 
Vn 
(10) C@=12-8301Z— 1087 + = C=9'892L — 289-55 
7 
(11) C@=15°3265H’— 694 + > C=5:264H + 86805 (auricular height) 
nN 
(H'=total height) 
TABLE VI. 
Females. 
Negro German 
(12) @=17°872B-1114 +7 C=15-716B — 927-66 
nN 
87 
(13) C=12°46 L- Oe ae C=12°055L —755°53 
nN 
98 
(14) C=10°871H’— 184+ J, | C=10993H+ 8213 (auricular height) 


(H’ =total height) 


Se =r 
5 ‘ 
4 
7 
wu 
at 
Pes. 
a Ps fed 
; : 
j 
i 
; : 
- ay ra 
: : 
: 2 
7 ‘ 
1 a 
ie 
7 ' 
: : 
2 t - 
1 
=. ae 
1h 
A ¥ 
i 
_ 
a 
: _ 
1 


= 


; 
: 
= 
7 
om 
- * 
: 
: 
* 
got 


Biometrika, Vol. X, Part | Plate X 


Dr Maynard’s Piebald Negro 


Miscellanea 193 


No great degree of accuracy can be expected in reconstructing the capacity of a skull from a 
single measurement, but the remarkable difference of formula (11) for negro skulls from the 
corresponding German formula is of course due to their referring to different measurements 
of the height. If we insert H=133°8 in (11), which is the mean total height of the Bavarian 


96. 
skulls we get C=1356'7 + —= instead of the measured mean C=1503 cm’. 


Ne 
sos : ; 105. : F : 
Similarly equation (9) gives C=1555°6 + a instead of 1503 when we insert the German 
n 
mean B=150°5. 


Thus 9—14 are of little use for our purpose. 


VII. Note on a Negro Piebald. (C. D. MAYNARD.) 


THE remarkably interesting photograph of a negro piebald on Plate X has been forwarded 
to the Editor by Dr C. D. Maynard. The native comes from the district round Chai Chai. 
Dr Maynard writes from Ressano Garcia, and states that the hospital attendant took the 
photograph. The extraordinary interest of the case arises from the fact that the thighs and 
feet are of normal negro pigmentation, but in the other patches we have varying degrees of 
pigmentation of the skin down to albinotic white. Unfortunately there is no dorsal view, but 
the back is stated to be also affected with albinotic areas. The boy reported that he was 
in the same condition when born, and that the nature and areas of the pigmentation had not 
altered. 


VIII. Note on Infantile Mortality and Employment of Women, from 
the Report on Condition of Woman and Child Wage-earners in the United 
States, Volume XIII. Infant Mortality and its Relation to the Employment 
of Mothers. 

By ETHEL M. ELDERTON. 


THe author of this Report emphasizes the difficulty of determining the effect of women’s 
employment and points out that 


‘It would be possible to draw positive conclusions as to the relative importance of this particular 
factor only by point-to-point comparison of the infant mortality for a period of years in two large 
communities, or two classes of large communities, in which all the material conditions were sub- 
stantially common, with the single important exception that in one a considerable proportion of the 
married female population of child-bearing age were at work outside of their homes and in the other 
community with which the comparison was made none of the women were so employed. 


To admit of entirely sound conclusions, it would be necessary that the populations-—and especially 
the women—of both communities should be of like ages, races, and physical health, that their living 
conditions should be practically identical, and that, in a general way, the child-bearing women should 
be of about the same grade of intelligence....... In default of some such comparison on a broad scale of 
the mortality of the infants of working and non-working women of similar ages, races, intelligence, and 
living conditions, no one can determine accurately how many of the deaths of working women’s infants 
are due to the mother’s work and how many to the other conditions of their lives and environment.”’ 
(p. 18). 


The author illustrates the point by taking the six New England States and giving the infant 
deathrate, percentage of women of 16 years and over who are breadwinners, percentage of foreign- 
born to the population and percentage of population living in towns of 4000 and more inhabitants, 
and showing that, though the states with the highest infant mortality have also the largest 


Biometrika x 25 


194 Miscellanea 


number of women employed, they have also the largest percentage of foreign-born and of those 
living in urban surroundings, and that it is therefore impossible without further investigation to 
assign the infant deathrate to any of these three factors. 


A further investigation has been undertaken into the 32 Massachusetts cities and the death- 
rate under a year is given, the percentage of foreign-born, the births per 1000 of the population*, 
the percentage of women gainfully employed and the percentage illiterate, and a comparison is made 
between the ten cities with the highest and the ten cities with the lowest infant deathrate and 
percentage of women employed and the other factors enumerated. The conclusion is reached that 
“These comparisons indicate, superficially at least, that a more direct relation exists between 
infant mortality and the birthrate, the percentage of foreign-born, and the percentage of female 
illiteracy than between infant mortality and the employment of women.” (p. 38). 


There can be no doubt that a direct study of the infant mortality in relation to women’s 
employment can only properly be made, when we confine our attention to women, employed and 
unemployed, who are actually mothers and live in the same town, and when we correct for aget, 
and if possible home conditions. Still if we take a series of different towns the right method 
must be to correct by the method of partial correlation for such divergent factors as we are 
able to ascertain and allow for in the series of towns investigated. I have endeavoured to apply 
modern statistical methods to the data of this Report, taking as measures of the environmental 
conditions in the towns: D the general deathrate, 7= percentage of illiteracy, f= percentage of 
foreign-born population, e=percentage of females employed 10 years of age and upwards (note, 
not percentuge of employed mothers, so we may be largely measuring effect of child labour on 
future motherhood), and @=deaths under one year per 1000 births. Then we have for cor- 
relations : 

Tc = 68, Ta= “70, ldf= 74. 
Hence numbers of foreign-born and of illiterate appear to be slightly more influential on infantile 
mortality than employment of women. These values are certainly high and the first is the sort 
of crude value which is used as an argument against the employment of women. Proceeding to 
partial correlations we have 

Pac= 36, a= “43, oS 48, 

Ct ed "42, eas 57, Fide “Bl. 
We next corrected for two factors and found : 

iflde= “34, ef a= 12, i af= 43, 


Thus we see that illiteracy has least influence on the infantile deathrate and the presence of 
foreign-born most. 


But even the presence of foreign-born and of illiterates is not a very complete measure of 
environmental effects liable to influence the infantile mortality in different towns as apart from 
employment of women. Many women employed means industrial conditions and possibly 
generally bad environment. I have taken as a measure of this the general deathrate D and find 

TDa= 71, 'De= ‘47, TDi= ‘60, Di “49, 
Whence I find : 
prac='97, pra="62, pras="75, 
D'fe= “49, D'i= 61, Dit 68, 


showing very substantial relations after correction for a general measure of poor environment. 


* The author is not very confident of the full accuracy of the complete registration of births. 
+ Young women are often employed up to the birth of their first one or two children, but the death- 
rate of these elder-born is heavier than the deathrate of those who immediately follow. 


Miscellanea 195 


Next proceeding to allow for two factors we find 
pDra="23,  pora="44, ppl ae = "35, 


the latter result shows that general deathrate and illiteracy are about equally influential on the 
relation of employment of women to infantile mortality. Finally I corrected for all three factors 
and found : 

isDl de = "28 


or 60 °/, of the crude correlation 72,='68 is due to women being most employed in towns where 
the general deathrate is high, where illiterates are frequent and the population is largely foreign- 
born. How much further the relationship would be reduced, could we equalise other features of 
these Massachusetts cities, it is not possible to predict. The examination of the individuals im 
one city appears to me to be the only satisfactory method of disentangling the numerous factors 
which influence infant mortality. We commend, however, the study of the first part of this 
Report, as it deals very clearly with the difficulties which arise, and will counteract the tendency, 
which is prevalent, to assert causation whenever association is observed. The author lays stress 
on avoiding such logical confusions. 


Part Il of the Report deals with infant mortality and its relation to the employment of 
mothers in Fall River, Massachusetts. In 1908 the attempt was made to visit the homes of 
each of the mothers of the 859 infants who died during the year and to ascertain details con- 
cerning her occupation, ete. In 279 cases the family could not be traced. In 266 cases prior to 
the birth of the child the mother was at work outside the home while in 314 cases the mother’s 
work was limited to household duties or other work carried on entirely at home. Thus only the 
cases of deaths are dealt with and the causes of death are compared in the two groups of cases 
(1) when the mother was at work outside the home prior to the birth of the child and (2) when 
the mother’s work was carried on entirely in the home. 


I hold that this method will never prove as satisfactory as that employed in districts in 
England ; in England certain districts are chosen and every baby within that area is visited 
and the deathrate per number born in one group can be compared with another and the 
circumstances surrounding those babies who survive and those who die in the first year of life 
in a given district can be analysed. 


I do not think that the fact that a rather higher percentage of all deaths from gastritis etc. in 
Fall River occur when the mother works away from home and a rather higher percentage from 
congenital debility at birth when the mother does not work away from home will help us much 
in discovering the influence of the employment of the mother on infant mortality, nor do I think 
it will throw much light on the question of stillbirths with which the Report also deals. It is 
found that there are no more stillbirths proportional to all deaths when the mother is industrially 
employed, but it seems to me that this tells us nothing about the number of stillbirths pro- 
portional to all births. The real question is whether mothers employed away from home in 
factory or workshop, wuose other circumstances are the same, lose more children in the first year 
of life or have more children stillborn than the mothers who are only employed in their homes 
and I do not think a comparison of causes of death will lead us much further, and I think it may 
lead to difficulties. 


When dealing with the mother’s work after childbirth in relation to the causes of infant 
mortality it is pointed out that the smaller percentage of deaths from congenital disease among 
the children of mothers who returned to work after childbirth was owing to the fact that most 
of the children dying from this group of causes died in the early weeks of life before the mother 
returned to work. For this same reason the number of deaths from gastritis ete. of children 
whose mothers returned to work is exaggerated, for we are missing out a whole series of illnesses 


196 Miscellanea 


which have ceased to add to the child deathrate by the time the mother returns to work and we 
must increase in this way the percentage of deaths of any disease of the later months of a child’s 
first year of life. 


It seems to me that a comparison of deaths in this way will really give very little information ; 
an excess of deaths from one disease means a defect in some other disease; it is shown that 
when the baby is nursed exclusively by the mother 26-0 per cent. of the deaths were from 
diarrhoea, gastritis, etc.; when partly nursed the percentage was 52°3 and when artificial food 
was exclusively employed the percentage of deaths from diarrhoea etc. was 42°9; the baby 
certainly dies less from gastritis when it is breast fed but it dies in greater numbers from other 
causes. Here again there is a difficulty; deaths from congenital diseases fall on the first weeks 
of life when breast feeding is the rule, while deaths from gastritis etc. fall on the later months of 
child life when “partial breast feeding” has become more common and I do not think it is 
possible to draw any conclusions from a comparison of deaths from one disease to deaths from 
all diseases as to the importance of artificial feeding in relation to deaths from gastritis. 


Interesting information is given as to the reasons for artificial feeding ; the numbers are not 
large enough to justify any definite conclusions, but thisis such an important part of any inquiry 
into the influence of artifical feeding on the infant deathrate that one welcomes its inclusion in a 
report of this kind. 


WE have been requested by Professor F, M. Urban to insert the accompanying announcement. 


ANNOUNCEMENT. 


A prize of One Hundred Dollars ($100.00) is offered for the best paper on the Availability of 
Pearson’s Formulae for Psychophysics. 


The rules for the solution of this problem have been formulated in general terms by William 
Brown. It is now required (1) to make their formulation specific, and (2) to show how they 
work out in actual practice. This means that the writer must show the steps to be taken, 
in the treatment of a complete set of data (Vollreihe), for the attainment in every case of a 
definite result. The calculations should be arranged with a view to practical application, i.e. so 
that the amount of computation is reduced to a minimum. If the labour of computation can be 
reduced by new tables, this fact should be pointed out. 


The paper must contain samples of numerical calculation, but it is not necessary that the 
writer have experimental data of his own. In default of new data, those of F. M. Urban’s 
experiments on lifted weights (all seven observers) or those of H. Keller’s acoumetrical experi- 
ments (all results of one observer in both time-orders) are to be used. 


Papers in competition for this Prize will be received, not later than December 31st, 1914, by 
Professor E. B. Titchener, Cornell Heights, Ithaca, N.Y., U.S.A. Such papers are to be marked 
only with a motto, and are to be accompanied by a sealed envelope, marked with the same motto, 
and containing the name and address of the writer. The Prize will be awarded by a committee 
consisting of Professors William Brown, E. B. Titchener and F. M. Urban. 


The committee will make known the name of the successful competitor on July 1, 1915. 
The unsuccessful papers, with the corresponding envelopes, will be destroyed (unless called for 
by their authors) six months after the publication of the award. 


Corrigendum. Dr Derry has most kindly pointed out a slip on p. 307, Vol. VIII; the value 
of 100 (B—#H)/Z for Congo female crania is +1:9 and not —1°9, which brings these crania nearer 
to their proper place, and the remarks on this point p. 308 should accordingly be cancelled. 


outs 


if Ana omy and Physiology 


Be ; Caren THOMSON, University of Oxford 
University of Cambridge x ARTHUR KEITH, Royal College a Surgeons. 
ARTHUR ROBINSON, University of oe 


oe VOL. XLVIII- 


: oa ANNUAL, SUBSCRIPTION 21/- POST FREE 


_ CONTENTS OF PART Ill. —APRIL 1914 


a Note on Two Cases of - ‘Well-marked ‘Suprasternal Bovis: Professor PETER 
P e Development of the Lobus Quadratus of the Liver, with Special Reference to an 
a jatar of this Lobe in the Adult. ‘Professor F, G. Parsons. The Characters of the English 
gh-Bone, FReprric Woop Jonss, D.Se. The Lower Ends of the Wolffian Ducts in a Female Pig 
bryo. D.Davinson Brack, B.A., M.B. (Tor.). Two Cases of Cardiac Malformation—more especially — 
7 ofithe Infundibular Region. ‘Rarer Tuompson, Ch.M., F.R.C.S. Figures relative to Congenital Abnor- 
malities of the Upper Urinary Tract, and some Points in the Surgical Anatomy of the Kidneys, Ureter, 
and Bladder. Haron Riscusters, "MLA., M.D., B.C. (Cantab.), F.B.C.S. (England). Anomaly of the 
aR Inferior Vena Cava: Duplication of the Post-Renal Segment. Brrnarp Cozn, M.B. A Communication — 
as to the ‘Causation of large Vascular Grooves found on the Inner Aspect of the Os Parietale. Rupert 
Downes, M.D., M.S. (Melb.). The Interrelationship of some Trunk Measurements and their 
Relation to. Stature. ed J. T. Wisox, M.B., F.R.S. Observations upon Young Human 
bryos. . Review: Biedl. Innere Sas eS - physiologische ae und ihre vieaianed 
die eeiclane. \ : i a 


4 4 - 


“Yoh ‘XLII. _Taly—December, 1913 


 Sommas, W. J., M. A, Sc. D., LL,D., F.R.S. Paviland One an Aurignacian Station in Wales 4 (rhe 
Hucley Memorial Lecture for. 1913). (With Plates XXI—XXIV.) Jounsron, Sir H. H., G.C.M.G., 
K.C.B., D.Sc. A. ‘Survey of the Ethnography of Africa; and the Former Racial and Tribal Migrations 
| thai Continent. ‘Eyans, Ivor H.N. Folk Stories of the Tempassuk and Tuaran Districts, British 
forth Borneo. ‘Donnas, Hon. (CHARLES. History of Kitui. Parsons, F.G. On some Bronze Age and 
ne utish Bones from Broadstairs, with Type Contours of all the Bronze Age Skulls in the Royal College 


Egyptian Sudan. (With Plates. XXV_XXXVIIL) Hitron-Stueson, M, W., F.R.G.S. Some Arab 
and Shawia Remedies and Notes on the Trepanning of the Skull in Algeria. (With Plate XXXIX.) 
of Felloys. ‘SANDERSON, REP URE Ge enya Games of Central Africa. mee Title Page and List 
of Fe oye. ae ee ‘ 


ee WITH NINETEEN ‘PLATES AND. MANY ILLUSTRATIONS IN THE TEXT. 
we arma se) Se PRIOR: Ios, NET 3): 


y Lonoon THE ROYAL ANTHROPOLOGICAL. INSTITUTE, 50, Groat Russell Street, We, 
f or enyaeah any Bookseller ioe 


ah 


i . “ Publishea coiled the dacoion ef the Royal pee aed Tastinitel of Great Britain and Ireland. 
Bach number of MAN consists of at least 16 Imp. 8vo. pages, with illustrations in the text 
together ‘with one full-page plate; and includes Original Articles, Notes, and Correspondence; Reviews — 
and Summaries; Reports of Meetings ; and Descriptive Notices of the eaters of Museums and 
4 - Private Collections. : ; 
we, Mont ly or 108. une teas pena 


TO BE OBTAINED FROM THE 


inet 


of Surgeons’ Museum. “Seuiemann, C.G.,M.D. Some Aspects of the Hamitic Problem in the Anglo- 


University College, London. It is very desirable that a copy of all measureme 


- in Roman not German | characters. 


10s. net. Volumes I, Ul, UI, IV, V, VI, VU, VIIL and IX (1902. 


M.D. Dona “wie Plates I_VI ae 19 
Le) ‘Tables of Poisson’s Exponential piiomial 


ATES: Ae Poisson’s Dee of Small Numbers, By 
- Plate VII). alae PaaS 


IV. The Hainisnship Boreas Weight of. the Seed 
cof the Plant ee i} aes J. ARTH 
Diagrams in oe ae RE AN 


“15 admis in the text) 


‘VI. On Homotyposis and ‘Allied Charabkate's in 1 eee 
| Win1aM Rowan, K. M. ee B. Se. and . 

"Plates Natt set colours) and eo 

(Miscellanea: 


(i) The Statistical 1 Sindy. of Dietiries, a + Reply to, ‘Professor Kart uP 
By D. Norn Paton, F.R.S. f é 


'Gi) The Statistical Study of ‘Dietaries, a 3y I 
(ii), Note on the essential Conditions that a ‘Population bre Ee 


‘ (iv) The Elimination of Spurious | Correlation due to ‘Postion 
By “Student” i” Saat a 
(v) | On ‘certain Errors at regan ho. Meltiple Comelation o ieee 


(vi) 

(vii) | (With Pla 

(viii) Note on™ “Tafantile salt and Bape of Women. 

; ELDERTON . ariel Sone aaa 
| Announcement by Professor F. Ae ‘Unpaw oe j on 


Coregendae PMR obras fds sety ERs che is 


The UhBapiione ‘of a mee in “Bioheotes nucle that i in Hine E 
method or material something of interest to biometricians. But the Ed 
understood that such publication does not es assent to the argu 
drawn in the paper. — 

Biometrika appears about four times a year. A volume co 
tables, is issued annually. ‘ 

Papers for publication and books and offprints for notice should: be sent to 


for publication, should accompany each manuscript. In all ¢ the papers t! 

not only the calculated constants, but the distributions from which th xy hi ve be 
-and drawings should be sent in a state suitable for direct photograp fic repr 
paper it should be blue ruled, and the lettering only pencilled ye 


Papers will be accepted in German, French or Ttaliar 


Contributors receive 25. copies of ae ‘papers ay i dditional copies may be 
payment of 7/- per sheet of eight pages, or part of a sheet t extr ; 
Plates; these should be ordered when the final proof is return d. 

‘The subscription price, payable in advance, is 30s. net per volum 


Bound in Buckram 34/6 net per volume. ‘Index to Volumes It 
‘to C. F. Clay, Cambridge University Press, Fetter Lane, Lon 
_pookgeller, and communications respecting advertisements should als id 

Till further notice, new subseribers to Biometrika may obtain Tols 

bound in Buckrvam for £12 net. A 

The Cambridge University Press has aerated the Univers thi 
of Biometrika in the United States of America, and has authorised them toc 
$7.50 net per volume ; 3) single parts $2.50 net each. 


NOW READY 
FOR STATISTICIANS - 


TABLES © 


. CAMBRIDGE: Tae BY JOHN CLAY, 


ae 


Vol. X.- Parts IT and III. November, 1914 


BIOMETRIKA 


i 


A JOURNAL FOR THE STATISTICAL STUDY OF 
! BIOLOGICAL PROBLEMS 


FOUNDED BY 
W. F. R. WELDON, FRANCIS GALTON anp KARL PEARSON 


EDITED BY 
KARL PEARSON 


CAMBRIDGE UNIVERSITY PRESS 
C. F. CLAY, Manager 
LONDON: FETTER LANE, £.C. 
EDINBURGH: 100, PRINCES STREET 
also 
H, K. LEWIS, 136) GOWER STREET, LONDON, W.C. 
WILLIAM WESLEY AND SON, 28, ESSEX SEREET, LONDON, W.C. 
CHICAGO: UNIVERSITY OF CHICAGO PRESS 
BOMBAY AND CALCUTTA: MACMILLAN AND CO., LIMITED 
TORONTO: J. M. DENT AND SONS, LIMITED 
TOKYO: THE MARUZEN-KABUSHIKI-KAISHA 


Price Twenty Shillings net. 


[Issued December 3, 1914] 


I. "Mathematical GontHibations to che | 


‘Theory of Evolution.— XIII. On the Theory 


of Contingency and its Relation to Associa- 
tion and, Normal Correlation. By Kary- 


Parson, F.R.S. Price 4s. ‘net. 
re Mathematical Contributions to. the 


Theory of Evolution. XIV. 'On the Theory |~ 


-of Skew Correlation and Non-linear Regres- 
3 sy By Karn PEARSON, F.R.S. Price 5s. 


II. Mathematical Contributions to ‘the 


Theory of Evolution.—XV. On the Mathe- 


- matical Theory of Random Migration. By 
Karn Pearson, F.R.S., with the assistance 
of JOHN BLAKEMAN, M. Se. - Price 5s. net. 


he IV. Mathematical Contributions to the 


Theory of Eyolution—XVI. On Further | 


Methods of Measuring Correlation. By 


Karu PEARSON, F.R.S. Price 4s. net. 


_-V. Mathematical Coo oNe to the 


I. On the Relation of Fertility in Man 
to Social Status, and on the changes in this | 
Relation that have taken place in the last 

50 years. By Davip Heron, M.A,, D.Se. 

= Price 3s. Sold only with complete sets, * 
Il. A First Study of the Statistics of 
Pulmonary Tuberculosis (Inheritance). By 
Karu Pearson, F.R.S. Price 3s. net. 

IlI. A Second Study of the Statistics of 


Pulmonary Tuberculosis. Marital Infec- | . 


tion. By Ernest G. Pors, revised by Karu 


Parson, F.R.S. With an Appendix on | 


Assortative et by. ErsEn M. ELpERTON. 
Price 3s. net. 


IV. The Health of the Schoo!-Child in re- | 


lation to its Mental Characters. By Karn 
; ~ Pearson, F.R.S. Shortly. «| 
V. On the Inheritance of the Diathesis — 
of Phthisis and Insanity. A Statistical _ 
Study based upon the Family History: of | 
1,500 Criminals. By Caries Gorine, 
M.D., BSc. Price 3s. net. 


Questions of the Day ae of the Pray. 


Pte The fe of Parental Alcoholism 


» on the Physique and Ability of the Of- | 


spring. A Reply to the Cambridge Eeono- 


; ee mists. By Karn Prarson, . B.S. Price 


TS. et Sep s 
_ IL. Mental Defect, Mal-Nutrition, uA 
the Teacher’s Appreciation of Intelligence. 
- A Reply to Criticisms of the Memoir on 
_ ©The Influence of Defective Physique and 
Unfavourable Home Environment on the 
Intelligence of School Children. {By Davip 
Huron, D.Sc. | Price 1s. net. 


‘an An Attempt to correct some of the |’ 


- Misstatements made by Sir Victor Hors- 
LEY, F.R,S., F.R.C.S., and Mary D. SrurGE, 


M. D. is in their Criticisms of the Memoir: | 


. A First: Study of the Influence of Parental’ 
_ Alcoholism,’ &c. ‘By Kart ee ge FR. S. 
Price \s. net. 


Tx, Mendelism and the Problem of ‘Mental Defect. LER On the Graduated Character of 2 
Mental Defect, and on the need for standardizing Judgments as to the Grade of 
which shall involve Heese By ite { RRASEES F.R. a bl umber.) ~ 


| Studies i mM National Deterioration, a ts 


Le a 


VE. Hugenics and Public Health, e 


na ‘Mathematical ‘Contrib tions 
Theory of Evolution.—XVIIL. (€ 
- Method of Regarding th 
_ two Variates classed solely i 
- Categories. By Karn Al 
Price 4s. net. 1a ee 
‘VIL Albinism in Man. By Ka yi 
|. NerriusHyr, and OC. H. UsuE 
Part II, and Atlas, Part II. Pri 
Te -Albinism in Man. By Kari Px 
eae hed Dry NEDTLESHIP, and ,C. H. UsHt 
- Part ya ,and Atlas, Part IV. Price 21 


ne beget peek 


ree ¢ 


arene) "Pubstodlowe. 

~ of the Tuberculous and Fauna 
ment. By W. P. Expsrron, | 
5. ae ae Price 33, Hs 


- Pulmonary Tuberculosis : the M 
_ the Tuberculous : Sanatorium a \ 

- culin Treatment. By W. Party EEDERTON, 

ie FILA, and Smyey Die PARRY e giv. 

_ Price 3s. net. LS i 


3 i A Statistical Study of Oral. ‘Tem 
peratures in School Child: ith spec 
veference to Parental,. Environmental and. 
Class Differences. By M. H. Wi 
M.B., JULIA Brut SM AC 
"PEARSON, ERS uae “6s. net. 


Iv. The Fight eee Tuberculosi and 
the Death-rate from Phthisis. Hh 
Sos eiby E.R. S. pes As. ne 


Past, » Present an “Putre,’ 
--PEAnson,: FERS. Price Is.. net. 


+ 


to the York Congress of the Royal S: 
Institute. By eee ieee FE. 


a Domai. ina The bebe of 
- Defect. ny Karu Pzarson, E.R. 
Gustav. A. JampERHOLM. Price 1s, net. 


VOLUME X NOVEMBER, 1914 Nos. 2 AND 8 


A PIEBALD FAMILY. 
By E. A. COCKAYNE, M.D., MRCP. 


In spite of the great interest, which they have always excited, well 
authenticated examples of piebalds in the dark races have been found to be rare. 
In the white races they are much less conspicuous, in part owing to the presence 
of clothing and in part owing to the lack of contrast between the pigmented 
and unpigmented skin, but the likelihood of their coming under the notice of 
a skilled observer is much greater. The scarcity of records shows that piebalds 
in the white races also must be very uncommon. Last year I met with a case 
in a baby, and found that the child belonged to a family, many of whose members 
showed a similar defect of pigmentation. The family, belonging to a farming 
stock, originally came from the neighbourhood of Bury St Edmunds in Suffolk, and 
the anomaly is known to have descended directly through six generations. The 
oldest member, with whom I have talked, is fairly certain that it was present in 
one generation at least before this. 

Of the first two generations in the pedigree (see Plate XI), I could obtain no 
definite information except the statement as to the existence of the piebaldism in 
I. 1 and II. 1, but of the third, III. 2 is said to have had a frontal blaze of white 
hair and white skin on the neck and forearms, which was very conspicuous owing 
to its marked contrast with the neighbouring weather-stained normal skin. 

IIL. 4 appears to have been the only member of the family who showed a 
marked dislike to the condition and always wore a wig to hide the frontal blaze. 
III, 2, whose family name was C—-—%*, had fifteen children. The first, IV. 2, 
a male, with dark hair, married twice, and had eight normal children, five by the 
first wife and three by the second. The second child, IV. 5, was a piebald, with a 
large frontal blaze, white skin on the front of the neck and arms, and blue eyes. 
He transmitted the condition to all his three children. V. 3, the eldest boy, 
aged 22 and unmarried, possesses dark hair, with a V-shaped frontal blaze of 
white or cream coloured hair, the apex of the V commencing near the coronal 
suture and spreading out to a width of 34 inches, as it reaches the forehead. The 
eyebrows and some of the eyelashes are white. The next boy, V. 4, is aged 18. 
He has light hair and a very large blaze of unpigmented hair, which covers the 
whole of the top of the head. His eyebrows and eyelashes are white, and the eyes 
are blue. Both boys have white patches on the front of the neck and on the arms 


(see Plate XII). 


* Names preserved in the confidential register of the Galton Laboratory, 


Biometrika x 26 


198 A Piebald Family 


Next in the fourth generation were twins, IV. 6 and 7, both piebalds. They 
were evidently not uniovular, because one had dark hair, and one light, and the 
white blazes were dissimilar in extent, but it is uncertain which had the larger. 
Both died at an early age. 

The next, IV. 8, a girl, was normal with dark hair and eyes and remained 
unmarried. Next came a woman, IV. 10, who was a piebald with a large frontal 
blaze, white eyebrows and eyelashes, and white skin on the front of the neck and 
forearms. The right eye was blue, and the left brown (see Plate XIII (B)). Her 
child, aged 13, is quite normal with light hair and dark eyes. 


The next child, IV. 12, Mrs W——, has a large frontal blaze and dark brown 
irides. There is a large irregular patch of white skin extending from just below 
the chin to the heads of the clavicles, and round it the skin appears to be more 
deeply pigmented than the rest of the skin of the neck. There are a few small 
islands of pigmented skin near the edge of the unpigmented area. The skin of 
the anterior aspect of the forearms is unpigmented from the elbows to the wrists, 
and here also, there are some small islands of pigmented skin in marked contrast 
to the unpigmented area, in which they lie (see Plate XIV). 

The first two children of this individual were daughters, V. 8 and V. 9, both 
piebald, the third a normal son, V. 10, and then three more piebald daughters, 
V. 11—138. The first of the daughters, Mrs G——, V. 8 (see Plate XIV), is very 
fair with a very large frontal blaze covering the whole of the top of the head, 
and her eyebrows and eyelashes are white. Her normal hair has pale creamy 
diffused pigment and, according to the individual hair, some to a decided number 
of granules*. The hair of the blaze has no diffused pigment and no granules. 
The irides are light brown, but the outer segments on both sides are paler and 
greenish in colour. The skin of the forehead and base of the nose is very pale in 
colour. She has a large white patch on the skin of the front of the neck, beginning 
just below the chin and widening out so as to embrace that over the inner ends 
of both clavicles. As in her mother there appears to be some concentration of 
pigment round this white area, and there are small isolated areas of pigmented skin 
near its edge. She has unpigmented skin on the anterior aspect of both forearms. 


Of her two children the first, VI. 1, a boy aged 8, is normal, the second, VI. 2, 
a boy aged 14, is a piebald (see Plate XIV). This child, VI. 2, was nine months 
old when first seen. He had a very large frontal blaze, resembling that of his 
mother and covering all the top of the head, the eyebrows and eyelashes were 
white with the exception of some of the outer hairs. Hair, pale cream in colour, 
said to be from the light area, has pale creamy diffused pigment and some granules 
(8), the granules being very small. It was obvious, even at this age, that hetero- 
chromia iridis was present. The right iris was pale except for a sector of dark 
grey occupying the upper and outer quadrant, the left iris was entirely dark grey. 
No difference in colour of the skin of the neck or forearm could be made out. 


* B to y on the Galton Laboratory scale of granular pigmentation. 


KE. A. CockayNneE 199 


When the baby was seen after the summer of 1913, the grey portions of the irides 
were becoming brown, the pale portion was still light blue. The face and arms were 
sunburnt, and it was noticed that the forehead was paler than the rest of the face. 
There was a pale area on the front of the neck, and the whole anterior surfaces of 
the forearms were white, the edges being very irregular in contour. There was 
also a white streak running obliquely right across the posterior or extensor aspect 
of the left forearm, and this offered a marked contrast with the rest of the surface, 
which was very brown. When the sunburn had died away the difference between 
the pigmented and unpigmented skin could no longer be made out. 

IV. 12’s second daughter, V. 9, aged 23, has only a small cream coloured 
frontal blaze, and the rest of her hair is light brown (see Plates XV and XVI). 
The eyebrows are composed of an even mixture of brown and white hairs, and 
the eyelashes are similar, with brown and white hairs alternating. The irides are 
grey and uniformly pigmented. There is a large irregular area of white skin at 
the base of the neck. 

The whole of the anterior aspect of the right forearm is unpigmented, and 
there are similar small areas scattered over the posterior aspect (see Plate X VII). 
The left forearm is white only on the anterior aspect. 

The next girl, V. 11, is aged 9. She has a very small frontal blaze, but the 
skin of the forehead is pale (see Plate XV). The eyebrows show a division into 
two parts, on the inner halves grow white hairs only, and on the outer brown hairs. 
The eyelashes on the contrary consist of alternate brown and white hairs. The 
irides are grey and uniformly coloured (see Plate XVIII). There is only a small 
white area in the middle of the front of the neck, but there are well differentiated 
white areas on the anterior aspects of both forearms (see Plate XIII (A)), and on 
the inner aspects of both upper arms. Her hair was examined and the first sample 
showed very pale diffused pigment and some granules (8). Two more samples 
were then examined, one from the blaze and one from the neighbouring part of 
the scalp. The first showed no diffused pigment and no granules, the second 
showed the majority of hairs with yellow-brown diffused pigment and a decided 
number to plenty of small granules (y—6), but a few had no diffused pigment 
and no granules. 

The next piebald child, V. 12, died young. She had a frontal blaze and blue 
irides. Some of her hair showed very pale diffused pigment, and some granules (8). 

The next child, V. 13, also died young. She was a piebald nearer to the 
classical type than any of the others. She had a large frontal blaze, white skin 
on the forehead, and large areas of white skin on the front of the neck and 
chest, and in addition a very extensive area on the abdomen. 

Of the fourth generation the next child, IV. 13, was a male with dark hair and 
eyes, who had 5 normal children; the next, IV. 15, had fair hair and died young. 
Twins, IV. 16 and 17, came next and died in infancy*. They were heterogeneous, 


* The tendency to twin in this family is worth noting. 


26—2 


200 A Piebald Family 


a dark-haired boy and a light-haired girl. A girl, IV. 19, was born next and she 
had twin sons, V. 15 and 16, who were also normal. The last three children, 
IV. 20—22, a girl, a boy, and one whose sex I am unable to ascertain, were all 
normally pigmented and all died at a very early age. 


The pedigree confirms the strongly hereditary nature of piebaldism, and in this 
as in other published cases the character can affect either sex, but has only been 
transmitted by those affected. Unless we are to assume that in the case of such 
a rare anomaly as piebaldism, I. 2, II. 2 or III. 1, were really unnoticed piebalds, 
then III. 2 could only be heterozygous, or since piebaldism is dominant a (DR). 
We must take IV. 4, IV. 9, IV. 11 and V.7 for pure recessives (RR). Thus the 
number of piebalds in the five sibships of generations IV, V and VI should be one 
quarter, Le. $(15+34+1+6+2)=7 nearly. We have actually 14 out of 27, 
thus piebaldism does not seem to act numerically as a pure dominant. 

The areas of unpigmented skin are less than in the classical piebalds, but it is 
probable that in some, at least, they are larger and more numerous than I have 
stated. On the covered parts of the body and legs, which I was unable to 
examine except in the baby, they would not be very noticeable. It was not until 
I had noticed the white skin on the neck and arms of one of them that I was told 
anything about the existence of similar patches on the others. If true, it is 
remarkable that none have had white patches on the legs. 


With regard to the local distribution of the pigment, there appears to be an 
excess at the edges of some of the unpigmented areas, as has been noted in other 
cases. In the case of other pale areas, the demarcation between them and the 
normal skin is very slight, and is probably due to the fact that they are not wholly 
unpigmented. ‘This remark applies especially to the forehead, which in some of 
them looks paler than natural, but not wholly devoid of pigment. 


In some the eyelashes are alternately white and brown, and in others the eye- 
brows are similar, and in one at least hairs growing on the scalp near the blaze 
are in some instances entirely without either diffused or granular pigment. This 
suggests that the skin beneath may show a deficient and irregular distribution of 
pigment. 

The most interesting feature is the occurrence in three members of the family 
of well-marked heterochromia iridis, a character which has been met with in 
members of a piebald family, but always independently of their piebaldism, never, 
as in this case, in true association with it. It proves conclusively that these cases 
are not congenital leucoderma. 

There seems to be no association of piebaldism and general lack of pigmenta- 
tion of hair and irides. Affected and unaffected members have been both fair and 
dark, but the fairest piebalds seem to have the most extensive frontal blazes. 

In the cases photographed the individuals were blonds and there has been great 
difficulty in getting a good photographic contrast of differences of pigmentation 
very noticeable in the living subject. 


Plate X| 


Biometrika, Vol. X, Parts Il and Ill 


“ATLULR pleqeig Jo valsipog 


*"|RULION O “pleqatg & "SIPLIL BIMMOTYDOIeJaAY 2& ‘punod pelq + 


oor € : 


+ 
Cle 1G 1 Oo © © 2 2:01.06 Se 2 eo 
hc Oat 2 yp + 4 : + + 
6-6-6 ©6010 16 © 8 0 210 0: 62 © © @ 22) 


> 


‘III 


Biometrika, Vol. X, Parts Il and III Plate XIl 


V. 3 and V. 4 as children showing their marked V-shaped frontal blazes. 


Plate XIII 


Biometrika, Vol. X, Parts Il and III 


a 


‘aTqeysmnsurysip 


‘sudyT & 


Japun Aypno 


T 


ATIsva 


WW 


Tp Ut 


Ss 


M Uaes aq 


qa 


I SIPLIT BVIULOITOOLI}JoY otf} qnq 


Avul ozetq eu ‘OL ‘AL 


aq} uo va 


1B 


“WIR Jol ayy Jo yoodse ror 


ayOSNIT payerywa 


Taytp [Joa wv 


ajue 
SUIN 


ALOUS 


I 


iE 


A 


Plate XIV 


Biometrika, Vol. X, Parts II and Ill 


$ 


: 


“sUaT B JO BSN ay} YIM Z| ‘TA JO ada FYSLT oy} JO TWorjoos AoyABp a} PUY daIty [[B UL OTQISTA 


UB [BYUOIF VIA OTT, ‘mOspuRID . LGN: paw ‘TaTJOMIpPUBI vail ANT: ‘TaYyOoul "8 iN ‘suOTyRIoUIS vot} 


UL 


wsIp 


[|e 


qat 


tq 


Biometrika, Vol. X, Parts Il and Ill 


Plate XV 


‘wo sisters with white frontal blazes. 


dances 


V. 


Biometrika, Vol. X, Parts Il and Ill Plate XVI 


V. 9. Showing white forelock or blaze, 


Biometrika, Vol. X, Parts Il and Ill Plate XVII 


Right forearm of V. 9 showing white patches on posterior aspect. The photograph is untouched and it is difficult 
to bring out by photography the grades of pigmentation when the arm is untanned by the sun, although they 
are quite clear on actual inspection. 


Biometrika, Vol. X, Parts Il and Il Plate XVIII 


Large photograph of Y, 11 to show paleness of forehead and white hairs on inner half of eyebrows. 


CLYPEAL MARKINGS OF QUEENS, DRONES AND 
WORKERS OF VESPA VULGARIS. 


By OSWALD H. LATTER, M.A. 


Upon the front of the head of Vespa vulgaris certain yellow markings stand 
out conspicuously upon the otherwise black surface. Below the three ocelli 
and between the upper portions of the two compound eyes there is a median 
four-sided yellow patch, the “corona”; to the right and left of this, separated 
from it by a fairly wide interval, and occupying the bay of each of the compound 
eyes is a pair of elongated yellow blotches; while straight below the corona and 
between the lower portions of the compound eyes is a very conspicuous yellow 
area which “extends over the clypeus and down to the labrum or upper lip which 
hes between the two mandibles. This clypeal patch of yellow bears upon it a 
black mark which is subject to considerable variation. I distinguish in the queens 
and workers five chief types of this black mark: see diagrams on p. 202. In 
Type I a broad vertical black band extends right through the yellow patch from 
the top to the bottom; a little below its middle the band bears to right and left 
a pair of bluntly pointed and slightly upturned arms: the portion of the median 
band below these arms is somewhat narrower than that above. Type II is derived 
from I by suppression of the black portion below the transverse arms. In Type III 
the extent of the black colouring is yet further reduced by the absence of the 
upper half (or thereabouts) of the vertical band. In Type IV the lower part of 
the vertical band re-appears, but the width of all the components is very much 
less than in any of the preceding types. In Type V the component parts of the 
black marking cease to be in contact; the upper portion of the vertical band 
is interrupted by a broad belt of yellow; the two “arms” are separated from the 
lower part of what remains and from one another; while there is no black at 
all below these remnants of the “arms”—a feature recalling Types II and III. 
Types IV and V are however represented only by single individuals in the series 
examined. 


Between these main types certain intermediates occur. Thus some individuals 
have the black piece below the “arms” very narrow, approximating therefore 


202 Clypeal Markings of Vespa vulgaris 


to II, but conforming to I if we take extension of the black right through the 
yellow as the criterion of I; such individuals are distinguished as 1+ II. Others 
again conform to II but possess a slightly darker stain on the yellow in the line 
where the distinctive lower black portion of I might occur; these are called I+ 1. 
Similarly, intermediates between II and III are recognisable: in II + III the top 
of the upper portion of the vertical black band is very narrow; while in III + II 
there is a mere stain on the yellow of this region. A single instance occurs of an 
intermediate between I and III (1+ III), where the vertical black band extends 
right through the yellow, but is much narrowed at its upper extremity. 


Front of head of V. vulgaris ¢. 


SD Ocelli 
Eye 
Yellow 
Patch tn Girsen 
bay of Eye 


(All parts left white are actually yellow.) 


Clypeus alone 


HF) (*) @) 


Types I Il III IV W 
4 VII) VII VIII (3 VII) VIII 


Drawn by K. W. Merrylies. 


My first examination consisted of about 200 tubes containing queens of 
Vespa vulgaris from different nests. In the case of some queens the heads 
were missing, and in the course of transit the contents of some of the tubes 
had got loose in the jar. I have numbered these 199 to 208. The results are 
given in Table I (p. 204) and the summary below: 


O. H. Larter 203 
Type I Pure Poh 
‘. I+II 11 62 
, 1+ Il] I 
Type II II+I] 29, 
5 Pure 94} 120 
55 II + Il 4 
Type III III + I a : 
4 Pure 3, C4 
III +1V 0| 
Type IV IV +I1ll 0 
a Pure | il 
2! IV+V 0 
Type V Vea Hi 1 
r Pure 1 
Total: 185 


It will be seen that transitional cases undoubtedly occur. The bulk of the 
queens, however, fall into Types I and II, or queens are very little variable. 


To test: (1) whether this variability was still further lessened by taking only 


the queens from a single nest, and (11) the relative variability of queens, drones 
and workers, I now examined all the queens, workers and drones of a single nest 


of V. vulgaris. 
In this case all the 127 queens were of Type II*. 


The classes of the workers are given in Table II (p. 205) and may be sum- 
marised as follows: 


Type I Pure 5) 10 
§ v.s or (I+II)? 5| 
Type II II+I 6 
“ Pure at Ae 
Total: 172 


It will be seen that they are somewhat more variable than the queens of the 
same nest, but not so variable as queens from different nests. 


T now turn to the drones of this same nest. JI had 150 at my disposal. 


The drones exhibit a very wide range of facial markings. In the material 
examined comparatively few fall into the scheme of classification adopted for 
the queens and workers, and it thus becomes necessary to resort to six types 
of face which appear to be peculiar to the male sex. These are numbered VI, 


* There were 129 queens in this nest, but No. 34 was missing and No. 98 had its head damaged too 
badly for classing. 


204 Clypeal Markings of Vespa vulgaris 


VIG VID, VII, VII (4 VIID, VIII and IX, see diagrams, p. 202. In Type VI 
there are two somewhat elongated black dots upon the yellow clypeus, one being 
sub-central, the other on the ventral margin; in VI (4 VII) the ventral dot is 
longer dorso-ventrally and a third dot appears upon the left side (right side, in 
figure seen from in front) opposite the gap between the two previous dots; 


TABLE I. 


Types of Clypeal Markings in V. vulgaris Queens. 


No. | No. | No No. No. | 
| | 
| | 
| oe = 44 III 87 | 1+II | 180 = 173 Ill 
hee II Dials 88 II 131 II 174, | Tees 
| 3 | Il4I 46 II 89 ae a 132 II 175 II 
| 4 1 47 = 90 alge 133 Il WG ho WIESE 
ae ig alle AS. Wise Oe L 134 — 177 Tore 
6 at AQ) 3) > Sel 92 et 135 I Tey) ~ UE 
re |) IE BORE in ale 93) LEE in| 36 II 179 esi 
3 ail 51 if 949 i 137 II 180 II+I 
9 | II o2 II 955) ILE 138 I 181 II 
10 || I 5a el 96.09) 139 II 182 = 
11 Il+I 54a eee 97 I 140 I 183 | I1+II1 
12 II+1 ii) 10 98 dee 141 I 184 = 
Lea 9 al 56 md 99 |II+IIL} 142 Teen 185 I 
14 oe 57 IL 100 I 143 I 186 Il 
15 II 58 pa 101 ats 144 Il 187 =|) St 
16 II 59 I 102 Il 145 II 188 | I 
7 A eaeale 60) meet 103 I 146 I 189 I+II 
18 Il 61 IL 104 IU 147 II 190 II 
19 | II+I 62 II 105 Il 148 =e 191 I 
20) ae 63 II 106 a 149 i 192 I 
HS ee Al 64 | II 107 I 150 Il 193 III 
22a mel (ij) = TU 108 Il 151 II 193.4 II 
937 | 21 66 II 109 I 152 = 194 = 
OAgea eles 67 = TOKO} 7 |) S50 153 II 195 | I+III 
25 IL+I Con ay tll 111 | IE+T 154 1H 196 II 
26 I 69 eet 112 Il sys | AU 19 jee 
27 Il 70 113 I 156 Il 198 IIl+I 
28 | I a Il 114 II 157 Til 
29 TST 72 ] 115 |II+I0T} 158 IIl+I = - 
30 I a eal 116 II 159 Il 
31 II Mee letsel Has IU 160 I Loose | 
32 II Aven AT 118 I 161 I 
33 V 75 | II ie) |). HE 162 Il 199 I 
34 Tal M6 | SUL 120 |II+III} 163 I 200 I 
Si alipageeee i, ae 191. || 7 ale 164 II 201 I 
36 iat AGH), eae 129° | 165 II HO i) ULE 
37 I oe ee 122 -- NGS | IM 2O3ke ee lal 
38 II 80 ee 124. I Gy | 7 204 Il 
39 Il+I] Se eee 125 II 16S 205 II 
40 ete 82 - 126 elie es 169 |) »— 206 Il 
Alton lee 83 I NO Wel 70a) eae 207 Ll 
AN Oyl ad 845 i 128 | II 171 I 208s 
| 42 Il 85 128b Il 172 = 209) | sri | 
| 43 I 86 | TSU 129 SHI | | 
| 


O. H. Larrer 205 


TABLE ILI. 
Types of Clypeal Marking of Workers of a single Nest of V. vulgaris. 
No. No. No. No. No. 
1 — 37 II 74 II 112 II 150 II - 
2 — 38 IT 75 II 113 II 151 II+I1 
3 — 39 II 76 II 114 II 152 II 
4 _- 40 II 77 II 115 II 153 IT 
5 — 41 II 78 — 116 II 154 IT 
6 — 42 II 79 I 117 II 155 II 
7 — 43 II 80 II 118 II 156 II | 
8 —- 44 II 81 II 119 II 157 Il | 
9 II 45 |Iv.v.s. 82 II 120 II 158 II 
10 II [1+11] 83 II 121 II 159 | II 
11 II 46.) I 84 II 122 II 160 II 
12 Iv. s. 47 II 85 it 123 II 161 II 
[I+11] ] 48 I 86.) 1d 124 I 162 | II 
13 II 49 —— 87 II 125 II 163 II 
14 II 50 IT 88 II 126 II 164 I+! 
15 I 51 II 89 II 127 II+1 165 II 
16 II 52 II 90 II 128 II 166 II 
17 II 53 II 91 II 129 Il 167 Il 
18 IT 54 II 92 II 130 II 168 IT 
19 II 55 II 93 II 131 II 169 II 
20 II 56 II 94 Il 132 II 170 II 
21 II 57 IT 95 IT 133 II 171 II 
22 IT 58 Iv. s. 96 II 134 II 172 II 
93. | Tf Aeim o7 fo = | 135 | mm} ia | 1 
24 II 59 II 98 II 136 II+I1 174 II 
25 I 60 II 99 II 137 II 175s I 
26 II 61 II 100 II 138 II 176¢" If 
27 II 62 lil 101 II 139 1H ied, II 
28 II 63 II 102 II 140 II 178 II 
29 II 64 II 103 IT 141 II 179 II 
30 Iv.s. 65 I 104 II 142 Il 180 Il 
f[+it)| 66 -| IL 105 | II ea iii [er 
31 II 67 II 106 II 144 II 182 II 
32 II+I 68 II 107 II 145 II 183 II 
33 II 69 I 108 II 146 II 184 II 
34 — 70 II 109 II+I 147 II 185 — 
35 IAs al II 110 II 148 II 186 — 
(I+1I] } 72 II 111 II 149 IT 187 II 
36 II 73 II 


in VII the two median dots are united by a slender black line, and there is a pair 
of lateral dots, right and left; in VII (4 VIII) the median line is of uniform 
width, extending from about the centre to the lower margin, and to its left side 
there is a single dot; in VIII the median line alone is visible, both lateral dots 
having disappeared; while in IX there is no continuous median line, but merely 
two black spots, one at the extreme dorsal and the other at the extreme ventral 
side of the clypeus. It will be noticed that Types VII—VIII approximate to 
Type IV in so far as the black stripe begins at about the middle of the clypeus 
and extends right down to the ventral margin. 


Biometrika x 27 


206 Clypeal Markings of Vespa vulgaris 


The data are given in Table III (p. 207) and are summarised below: 


very narrow 6 
ape a A VII) if f 
Type II no horns 1, very narrow 1 2 
Type III : : : : : 0 
Type IV : : : : : 0 
Type V ; : : ; 0 
Type VI Pure oi 60 

3 Vale vali 1 
Type VII 5 : : . nee, 

Type VIII (VIII near VI) 8 
F (VIII + $ VIT) 27 58 

, Pure 48 
Type IX : : : : ‘ 2 
Total: 151 


It will be realised at once how far more variable the drones, of even one nest, 
are than the workers or queens for this character. But their variability is rather 
of a negative than a positive character, appearing to consist in more or less extensive 
absence of the fuller markings of queen and worker. 


The results here deduced for variability of non-measured characters do not 
wholly agree with those found by Wright, Lee and Pearson on the wing measure- 
ments of the same nest of V. vulgaris. They found that for absolute measurements 
the variability as determined by the coefficient of variation was in every case such 
that the worker was more variable than the drone and the drone than the queen. 
On the other hand they found when they dealt with zndzces that the drone for 
wing measurements was slightly more variable than the worker and the queen 
less variable than either*. Possibly the divergence apparent here may be explicable 
in tbe sense of the drone’s variability lying in the present case in an absence 
of marking rather than in any positive variation. The drone’s variation is about 
a centre of much dimimished marking. If we could measure the variation in the 
total area of marking in queen and worker we might find it as great as the varia- 
tions in the smaller markings of the drone. 


It would be of much interest to investigate a series of drones from different 
nests. It is clear that the clypeal markings form a secondary sexual character 
and they would probably provide classifications for hereditary purposes. 


* Biometrika, Vol. v. pp. 414 and 421. 


O. H. Latrrer 


TABLE III. 


bo 


Types of Clypeal Markings in Drones of a single Nest of V. vulgaris. 


No. No. No. No. | No. 
| 
1 VIII 33 VII 63 VII 93 VI 123 VIII 
near VI} 34 VI 64 Wi 94 VI 124 I 
2 VI 35 VIII 65 VII 95 VI narrow | 
3 VIII 36 VI 66 VIII 96 VII 125 VI 
4 VIII 37 VI 67 I 97 il 126 VIII 
5 VI 38 VII very very 127 VII | 
6 VI 39 VI narrow | narrow | 128 VII | 
7 VIII 40 VII 68 VI 98 | VIII 129 VIII 
8 VIII 4] VI 69 VIII 99 VIII 130 VIII 
9 VIII 42 VI 70 VI 100 VI 131 VI | 
10 IX 43 VIII Hil II 101 | VI gy VIII | 
11 VI 44 VI but no 102 AVIA 133} VI 
12 VIII 45 VIII horns 103 VIII 134 VI 
13 VI 46 VIII 72 VIII near VI} 135 VI 
14 Vall 47 VI Wp} VI 104 VI 136 VIII 
15 VIII 48 VIII 74 VIII 105 VIII 137 VI 
near VI} 49 VII 75 VIII 106 VII 138 I 
16 VIII 50 VIII 76 VIII 107 VIII dots of 
17 Vi 51 VII 77 VI 108 VIII VII 
18 VI 52 VII 78 VI 109 VI 139 VII 
19 VIII 53 VIII 79 I 110 VI 140 VI 
20 VII 54 VI narrow Wadi WAL 141 VI 
21 VII 55 I 80 VI 12 VIII 142 VIII 
22 VIII very 81 VI 13 VI 143 VIII 
23 VIII narrow | 82 VIII 114 VIII near VL 
near VI} 56 IX near VI} 115 VI 144 VI | 
24 VII 57 I 83 VIII 116 VI 145 VIII | 
25 VII very 84 VII 117 II near VI 
26 VI narrow | 85 VI very 146 VIII 
De VI 58 VIII 86 VI narrow 147 VI 
28 VI near VI} 87 VIII 118 == 148 VIII 
4 VII 59 VIII 88 VII 119 VII 149 Vali 
29 VIII 60 VIII 89 VI 120s ee Vall 150 VI 
30 VIII 61 VIII 90 VIII 121 VI 151 VI 
31 VI 62 VIII 91 VII 122 VI 152 VI 
32 VIII iVII 92 VII 
4 VII 


27—2 


“ar 


TABLE OF THE GAUSSIAN “TAIL” FUNCTIONS; 
WHEN THE “TAIL” IS LARGER THAN THE BODY. 


By ALICE LEE, DSc. 


In a paper published in Biometrika, Vol. v1. pp. 59—68, tables for the’ in- 
complete normal moment functions were printed, and they have since been 
reproduced in Tables for Statisticians and Biometricians recently issued from the 
Cambridge University Press. From these tables values of the Gaussian “Tail” 
functions were deduced and a short table of yy, and yy, appeared in Biometrika, 
Vol. vi. p. 68. The value of these functions being demonstrated in practice 
during the last few years, a more complete table of yh, W., Wy; has appeared in the 
Tables for Statisticians and Biometricians. 


In the introduction to those tables, however, Professor Pearson indicated that 
it was important to have a similar table when the “tail” forms more than half the 
entire curve, and gave the fundamental formulae for obtaining the numerical 
values of the functions. The present table has been calculated to supply the 
want thus indicated. 


10 


ff 
y 


Li 


E B oO H Cc 


Let the figure represent a Gaussian curve of total population WV and standard 
deviation o. Let AB be the ordinate at which it is truncated and let 
OB = haxaor 
Let GH be the ordinate through the mean G of the truncated portion and BH =d, 
the distance of the mean from the line of truncation, let } be the standard 
deviation of the truncated portion about GH, and n=the area of the truncated 
portion, or of the population observed. Then if any material be supposed to 


ALICE LEE 209 


form a truncated portion of a normal curve, d, n and = can be found (see Tables, 
pp. xxvii and 25). 


We have Apt ORY Sccaids oe str gh ssa oj Sow ee din «ies son (i), 
Arn 10) pte sc rmas enee nice Se ncnen tageaaGs Antes (11), 
Aira OV Ute cee cranes Gacis cuties aihsielstas vas b,c (iil) 


These are tabled for each value of h’, at first proceeding by (01 and then by ‘10 as 
unit. Now y, being known we find h’ from the table, and hence deduce yy, and 
3. 2 gives us the value of o from known d. Hence h=h’ xo can be found, 
lastly (111) gives us the total population from which n is drawn. Thus the constants 
NV, o and h which fix the total Gaussian are determined. 


It will be sufficient to illustrate the method of using the tables on certain data 
as to the English thigh-bone, recently published by Parsons*. 


Dwightt+ has adopted a method of sexing human femora on the basis of a 
markedly bimodal distribution obtained by him for American bones. He terms 
female any femur with diameter of head less than 45 mm., and male any femur 
with diameter of head over 47 mm. Parsons follows this rule and sexes by other 
points femora with heads from 45 to 47. As unsettled remainder he has 20 femora 
of 45 mm. and he gives 12 to ? and 8 to #3; of 46 mm. and 47 mm. he has 41 
femora and he gives 4 to ? and 37 to f. As a result of this process he obtains 
a female frequency curve which rises very abruptly at high values of the diameter, 
and a male frequency curve which rises very abruptly for low values of the femora. 
But, if there really be any marked skewness in frequency of the parts of the 
human skeleton, which is very unusual, we should anticipate that it would be of 
the same sense. Parsons’ distributions are as follows (loc. cit. p. 256): 


50 | 51 | 52 


| 
36 | 87 | 38 | 39 | 40 | 41 | 42| 48 | 44.| 45 | 46 | 47 | 48 | 49 


1 1/—| 3 |] 8 |] 14; 12) 18 | 12] 12} 3 
8 | 8 


a 1 can 
99) | 1741 Si || 19 i 10 


bo 


The ¢ 48 mm. femur according to.the rule should have been treated as a 
male but presumably it had marked female characters. Were there no marked 
male characters in any bone below 45 mm.? It will be seen that there is a 
remarkable dip in the total material at 46 mm. which corresponds to Dwight’s 
division. In material measured six years ago in the Biometric Laboratory, where 
every bone in a relatively large series was measured, no such dip occurs and there 
is in those data no justification for Dwight’s method of sexing. The group of 
29 § bones at 47 mm. and the sudden cut off at 45 mm. seems to condemn this 
method of sexing, at “ay rate from the statistical standpoint. 

* Journal of Anatomy and Physiology, Vol. xiv. pp. 238—267. 


t+ American Journal of Anatomy, Vol. tv. p. 19. 
+ This material has been statistically reduced and will shortly be published. 


210 Table of the Gaussian “ Tail” Functions 


Without arguing this point out here, we may illustrate the use of the Table 
(p. 214) of w’s by taking two of Parsons’ frequency distributions for females; we 
will cut them off at the points suggested, and then investigate the total popu- 
lations of females which result. Our author pools for these distributions right 
and left bones. 


Taking the diameter of “head of femur” for the females, we have 


| | 
| Diameter in mm. ... | 36 | 37'| 38 | 39 | 40 | 41 | 42) 43) gL 
| | 


Frequency ... sohe |Past 1/—]} 3] 8 | 14 12 | 18 | 12 


These are exactly the bones the Dwight process gives as female. We find 


>? = 28851, 
d =2°6159 (measured from 44°5). 
Hence wy = S2/d? = 4216. 


Whence by interpolation from the table 
h’ ='782, W,='864, W;= 1-278, 


leading to o = 2:260, he edGn, 
and Mean = 42°73 mm., NV =88-2. 
Parsons gives for R. femur, Mean = 43, 


L. femur, Mean = 42, 


and the total number of bones dealt with 55 + 48 = 1083 (Tables, loc. cit. pp. 249— 
251). In his frequency distribution (p. 256) he only records 85 female bones, 
which give a mean of 42°54 and a standard deviation of 2078 mm. These values 
are clearly not widely divergent from those we have found above by supposing 
all bones under 45 to be female. 


To test the matter further the 105* female bones of which the head was 
measured by Parsons were taken out. They provide the distribution : 


how | 

| Diameter in mm. ... | 36 | 37 | 388 | 39 | 40 | 41 | 42 | 43 | 44 | 45 | 46 | 47 | 48 | 

Sil 

Frequency... ... | 1 | 1 | 4 | 109)18' | eulPotalarsiie4 3) == leer 
These give Mean = 42°47 . 
3 Dis 1-996; = 105, 


* It is not possible to say whether he has omitted two queried measurements. He has not omitted 
bones he queries in breadth of lower articulation. 


| 
| 


| Frequency 


ALICE LEE 211 


Cutting off all bones over 45°5 we find 
>? = 26392, d=2-6264, 
leading to vr, = 3826. 
Hence bh’ ='984, WW. ="783 and y,= 1195. 
These provide for the non-truncated population, 
Mean 42°48, s.D.=2:056, N=104, 


which are in still better agreement with Parsons’ constants for the 105 bones than 
the constants for the 85 bones were for their series. It would appear therefore that, 
if we suppose all bones under 45 female and use our Tables, we get results in 
reasonable accordance with Parsons’, and possibly by a theoretically more justifiable 
method than endeavouring to sex the bones above 44 and below 48 from other 
characters. 

We have considered from the same aspect the character breadth of lower 


articular end of femur. Parsons’ distribution of 89 female femora is as follows 


(p. 257): 


i | | 
q . ; Ay SPL yee ya ae llie a wh | yw Prpiliay re 
| Breadth in mm. ... | 60 | 61 | 62 | 63 | 64 | 65 | 66 | 67 | 68 69 | 70 | 71 | 72 | 73 | 74 
| | | | 
| 


a : SeEeat 


In this Table he has only one bone in excess of the numbers on which he 
bases his means on pp. 250—1. If we truncate at 69'5, Le. reject all bones over 
69 mm., we find 

2?=3'9803, d=2:9058, 
and pr, = "4714. 


Hence we deduce 
h’ =5295, We= 977, W,=1426. 
These lead to 
h=1:503, o=2:839, N=984, Mean=68-00 mm. 
The actual values given by Parsons’ distribution above are 
o=2571, N=89, Mean=67'54 mm. 

Thus the agreement is not nearly so good as for the diameter of the head ot 
the femur, being about 10°/, wrong ino and NV. It should give as good a result 
if the method were quite satisfactory, for the bones have been sexed by the 
diameter of the head, and the limit 44 mm. for diameter of the head corresponds 
fairly closely to 69 mm. for the breadth of lower articulation. 


As this paper is not intended as a discussion of Parsons’ data, to which we 
hope again to return, we will only deal with one more illustration of the use of 


212 Table of the Gaussian “ Tail” Functions 


the Table. We take out from his Tables, pp. 244—248, the diameter of head of 


femur for 174 male bones. 


ee 
Diameter of Head in mm. ... | 45 | 46 | 47 | 48 | 49 50 | 51 | 52 | 58 | 54) 55 
=: Teas (sae 
Frequency ... pb pee ||) c 33 | 17 38 | 20 18; 12; 6 | 8] 3 
| 


The constants of this distribution are 
Mean=49'14, o=2377, N=174. 
Truncating at 47°5 we have 
>? = 3'4341, d=2°7869, 
whence yw, = 4422, and from the Table 
h'='679, W.=°908, wW,=1°331. 

These lead to h=NT132 

and Mean = 49°22, o=2°530, N=162, 


Le. to a “tail” of 40 not one of 52 below 47°5. Actually this tail distributes itself 
as follows: 
Onder 45 45 46 Av 
Gaussian tail 5 6 12 17 
Against Parsons’ 0) 9 10 33 


This confirms our previously expressed view that probably a considerable 
number of the bones classed as 47 mm. are really female femora, and that the 
male distribution runs considerably beyond 45 mm. into the range treated as 
purely female. 

Finally let us try the result of pooling male and female bones and breaking up 
the composite frequency by the method of Phil. Trans. Vol. 185 A, p. 84. 


We have now 279 bones distributed as follows: 


ane ; | | ie | 
HER UES ser 36| 37 | 38| 39 | 40 | 41 ca 43 | 44| 45 ale 47 48| 49 |50| 51| 52| 53| 54| 65) Total 
ea ke 
oe = 
Frequency {1} 1]—/ 4 [10 Bee) 13/33/18| 3820/18/12! 6 | g|3| 279 
| 
The constants are Be = 46°63, s.D.=3°93, 


by = 15°4040, bs =— 67791, 
= 5415162, ws = — 5380°5339. 
The nonic is 
gq.’ — 596189," + '0689¢.5 + 9°83579q. — 3:°4275q.' — 8: 2041¢,? — 30209," 
+ :0144g, - 0097 = 0, 
giving the root q,=— 934 and p,=— 9°34, 


ALICE LEE 213 


and ultimately the two components : 


Male Female 
Mean ‘ 2 : : f 49°83 43°72 
Population : ; ’ 133°25 145°75 
Standard Deviation . d : 2B 2°662 
Max. Ordinate . . : : 23°83 21°84 


While the means agree roughly with those obtained by Parsons’ sexing (49 and 
43), we see that this analysis much more nearly equalises the number of male and 
female bones, and indeed makes the female population rather larger than the male, 
while Parsons has 79 °/, more males. The “truncated tail” method would probably 
give results in better accordance with the present had we not truncated at the 
quite arbitrary Dwight-Parsons’ divisions. 

These examples may suffice to illustrate the application of the Tables to 
anthropometric measurements on man, where we can feel fairly confident that 
the material, if sufficient in quantity, would be adequately described by a Gaussian 
or normal distribution. Such cases may arise when material for the two sexes, 
or for two races, is commingled and we can be fairly certain that one or other or 
both “ tails” of the material present homogeneous parts of the mixture, 


Another illustration drawn from Galton’s data for American trotters will be 
found in the Tables for Statisticians, p. xxvi. The chief weakness of the method, 
besides the assumption of the Gaussian, often quite legitimate, is the absence as 
yet of the values of the probable errors, which values must be very considerable 
for slender material such as that used above. 


See following page for Table of Gaussian “Tail” Functions. 


Biometrika x 28 


214 Table of the Gaussian “ Tail” Functions 


Table of Gaussian “ Tail” Functions, “ Tail” larger than “ Body.” 


W aT (—) Ay Yo (—) Ape Ps (-) Ay h’ 
F OF aDe “f) 
00 | BS |. 008 le Bee coogi mh 200) (| oer samme 
OL |) 860" 5 Oa aly 248 algae oan imn capes 01 
03 | °567 2 | 1-949 1-969 02 
002 006 : O15 
08. | penugl” SOEs" | soso eee maa Ose eg ee 03 
0, |) -5e4".| 22 | eBay Ia te len Osea one 04 
oo | bea" (|) 7Pe ly eebi emcee ell wipe ual sae 05 
06 || 2560, La lonm |) 1210 ae eal mtn weeny 06 
O7 i558 | oa || aie coer ee ay OV 
08: | 55m eee || q-bosmiaw 1880 | | 08 
09 | °BBB sae 1203)741), oan el E86 a 09 
“1 Cie te ate SIS? «lie 1-352 | “1 
2 535 oe 1143 As 1°726 Was 2 
3 DIG W/aore (C1000) | pee, eee eee 3 
4 497 | Cra |, 1040 |, poh aki eae 4 
of Avs d . 0 
2 cealenors Be ecw, op PSR | Snares io 
cioaadhe can re ae eves ee 6 
7 BE ae 899) || cocce || SteBLOnel uae 7 
9 oy, 4 “QR “Oo é 
2) | | S| Se | Ge | | 
1-0 380 | ‘O19 17 eee rises Le kg 
11 Br | elk FAO) | Ooh a alta a eee alate 
12 949 | (OTR S Srod yl 088) Siciso We en are 
E ‘018 ss 71 033 1-107 023 ry 
: i a | ae He Ph | acy PO | ‘4s 
15 88 | Ol? “10” || 080 S| jo7a 4) ClO, 
ze ‘O17 : 028 014 
ane eu 016 Be cin eae I nin |) Le 
v7 Doe ee Ce eerie) ep aces. ||. 27 
fe 4 4 “—° 6 
| 32] oe | S| Bs | tee | ae | 
2-0 =O alee 487 | 022 | 1-993 | 006 | 96 
2] ay ee 466 | 020 | j.o18 | 095 4 947 
a2 |--1g4 | Ols aap | LO i014 |e eles 
2-3 We alee A009) || 02085 alone adores 
ee eae ‘O11 as ‘017 1-008 :003 a 
oe 151 | 010 397 | 18 | ioe | 02 | a4 
Pye ae 010 ae ‘O15 ee 002 i 
26 Laie ieee ref tds | ee | oe 
"7 Teg | so8 369. | ota. | HROOBA IE ope alnne dy 
2°8 124 | - cos A eis | ean (Ree 
2-9 SECs | eae eee || leony | ear fe 
3-0 109 333 1-001 3-0 


See Zables for Statisticians and Biometricians, Introduction, p. xxvii. 


CONTRIBUTION TO A STATISTICAL STUDY 
OF THE CRUCIFERA. 


VARIATION IN THE FLOWERS OF ZLEPIDIUM DRABA LINNAUS. 


By JAMES J. SIMPSON, M.A., D.Sc. 


CONTENTS. 
Pact 
I. Introductory : ; : : d ‘ : : : . 215 
II. Botanical . , : : : : : : : ; . 217 
1. Specific Gharatters. : ; : : : : ; 5 aly 
2. Morphology of the Flower. ; . : : : 2 218 
3. Conception of Chorisis . ; : : : : : . 219 
4. Orientation of the Flower. : : ; : : . 220 
III. Examination of the Data . : F : ‘ : ; ; . 222 
1. Classification . : : : F : : : : . 222 
2. Analysis . 6 : : ‘ F , ‘ : ; . 223 
IV. Statistical . ; : : . 242 
1. Study of the cane and Semidand Deviations : ; . 242 
2. Study of the Correlation Coefficients . : : . . 245 
V. Morphological Significance of the Statistical Results : ; . 251 
VI. Variation in the Gynecium : : F : , . 255 
VIL Suggestions for Future Studies in ine Plane 257 


I. Inrropuctory*. 


In the summer of 1905 Professor J. W. H. Trail drew my attention to an 
extraordinary example of variation which occurred in the several organs of the 
flowers of Lepidium Draba Linneus. At that time I examined in detail 1832 
individual flowers taken from a single plant growing in a piece of uncultivated 


* Tam pleased to have this opportunity of expressing my great indebtedness to Dr J. F. Tocher for 
invaluable assistance in the biometric part of this paper. The correlation and other constants were 
calculated in his laboratory, and without his assistance the publication of this paper would have been 
greatly delayed. I must also thank Professor Karl Pearson, in whose department in University College, 
London, the statistical study was originally undertaken, for reviewing this paper for publication and also 


for much kindly criticism and advice. To Professor Trail my thanks are also due for many botanical 
hints. 


28—2 


216 = =©Contribution to a Statistical Study of the Crucifere 


ground in his garden, and the results of these observations form the basis of the 
present contribution to the study of the variation in the Crucifere. 


Botanical problems, which have been hitherto attacked from the biometric 
standpoint, have been comparatively easily handled, because the material has 
been more or less homogeneous in character. For example, variations in the 
number of sepals of Anemone nemorosa* or in the number of ray-florets of 
Chrysanthemum leucanthemuwm+, and the consequent distribution of these are 
capable of direct treatment by Pearson’s well-known method of fitting frequency 
curves, 


The only work comparable to the one in hand occurs in Biometrika, Vol. It. 
p. 145 (Variation and Correlation in the Lesser Celandine), but in this case the 
numbers of members in the calyx, corolla and androecium have been examined 
as a basis for a study of homotypic correlation and in this flower each of these 
organs consists of a single constituent with numerous members. 


The problems studied in this paper, however, are more complex inasmuch as 
they deal not with one organ of the flower but with all the organs, their con- 
stituents and members both separately and collectively. 


It is also, I believe, the first biometric work of its kind on a cruciferous flower 
and embodies a study of chorisis, that is, “the splitting up or division of one or 
more components of a flower into two or more equal or unequal parts ”—a factor 
which is supposed to have been of the utmost importance in the evolution of the 
natural order—Cruciferee. A complete discussion of this phenomenon is reserved 
until the flower is studied in detail. 


It would be well here to emphasise the fact that the flowers examined for this 
study were not taken from different plants but, on the contrary, were obtained 
from several inflorescences growing on stems which had arisen from buds on the 
roots of a single parent plant. This mode of reproduction is rather unusual, but, 
in the present instance, is of particular interest imasmuch as it gives greater 
homogeneity to the material. 

The parts of the flower which have been considered are (a) the perianth, which 
consists of (1) the calyx and (2) the corolla, (b) the andrcecium and (c) the 
gynecium. 

The functional differentiation of these organs is of great importance in the 
interpretation of results so that it might be well to recall the particular réles 
which these play in plant economics. 

The gynecium and the andreecium are respectively the female and male organs 


of reproduction and consist of carpels and stamens, while the perianth forms a 
protective covering for these delicate structures. The calyx or outer organ of 


* Yule, Biometrika, Vol. 1. p. 307. 
+ Biometrika, Vol. 11. p. 309 et seq. 


J. J. SIMPSON PANE 


the perianth is concerned solely in the protection of the flower in the bud, but 
the corolla, in the open flower, also serves, along with the honey-secreting sacs at 
the base of the stamens, as an attraction for insects. 


The characters which have been taken as a basis for this study are numerical, 
eg. the number of petals in the corolla, but no measurable characters, e.g. the 
length and breadth of the petals, have been considered, although, as will be pointed 
out later in connection with possible future studies in this flower, these characters 
might also with advantage be taken. 


The Crucifere, as an order, are usually regarded by botanists as being very 
definite in type and no observations have been recorded to show to what extent, 
if any, deviation from the recognised botanical floral formula exists, so that the 
main object of this paper was to determine the frequency of the variability of 
the parts of the various organs and constituents, and also the degrees of correlation 
existing between the organs themselves. 


The mode of observation is worthy of remark, however, as it might well be 
argued that if the flowers used for examination were fully “blown” deficiency in 
the number of parts might be due to post-developmental fracture, but in all the 
cases here recorded the observations were made on flowers in bud or only half open 
so that the influence of wind or other external agency is altogether discounted. 
The material was also examined microscopically in all cases so that there should 
be no possible doubt as to the exact origin of any member. The importance of 
this will be seen in the details of the analysis. 


II. Bovranrcat. 
1. Specific characters. 


The generic and specific characters of Lepidiwm Draba may be obtained in any 
complete systematic botanical work so that it is unnecessary to repeat them here, 
but a few notes bearing especially on the study in hand may be of value. 


It is a perennial about a foot in height and is covered by a minute down from 
which its popular name, the hoary cress, is derived. The inflorescence is a raceme 
not much lengthened and so forms a broad, almost flat, corymb-like termination. 
The individual flowers are small, white and numerous. The constituents of calyx, 
namely the sepals, are green; they are short, nearly equal and bear no pouch at 
the base. The petals are small and white; they are equal in size, obovate, 
undivided and generally stalked. The stamens are six in number; the filament 
is simple, 1.e. it bears no appendages, and is shorter than the petals; the anther 
consists of two roundish lobes. The pods are “ broader than long”; they are com- 
pressed laterally at rght angles to the narrow partition. The thick valves are 
boat-shaped and sharply keeled but not winged; each valve contains a single seed. 


218 Contribution to a Statistical Study of the Cructferce 


2. Morphology of the flower. 
The typical flower consists of six whorls, made up in the following manner: 
(a) Calyx 2. 
(b) Corolla 1. 
(c) Andreecium 2. 
(d) Gynecium 1. (Plate I, fig. 1.) 


(a) Calyx. This organ is composed of two whorls each consisting of two 
sepals. The outer pair arise at one level on opposite sides of the flower and are 
inserted on a slightly lower plane than the inner pair; they are parallel to the 
plane of compression of the gynecium. The inner pair are also situated opposite 
one another but in a plane perpendicular to that of the outer pair; they are thus 
at right angles to the plane of compression of the gynecium. These whorls are 
denoted on Plate I, fig. 1, by the Roman numerals I and IT respectively. 


(b) Corolla. This organ consists of four petals all inserted at one level and 
alternating with the position of the sepals; they thus constitute a single whorl. 
(See III, Plate I, fig. 1.) 


(c) Andrecium. Six stamens form the andreecium ; they arise at two different 
levels and thus constitute two separate whorls. The outer whorl, which is lower 
down, consists of two stamens which are shorter than the others and corre- 
spond in position to the inner sepals. The inner whorl consists of four stamens, 
arranged in pairs which correspond in position to the outer sepals. (A reference 
to the figure (Plate I, fig. 1) in which the two whorls are marked IV and V 
respectively will make this clear.) 


(d) Gyneciwm. This organ consists of two carpels forming the sixth or 
innermost whorl. (See VI, Plate I, fig. 1.) 


It will be seen from the foregoing description that the order of the six whorls 
here detailed is that in which they would be found were we to strip the flower of 
its components at the different levels consecutively from below upwards. It is 
also the order in which we would find them, passing from the outside to the 
centre, were we to cut a transverse section through the flower. 

Another point, however, which is not so obvious but one which has special 
interest in our study, is the fact that this is also the order in time of development. 

The actual sequence in which these constituents of the flower appear in the 


bud is therefore : 
I. Outer whorl of Calyx (Sepals). 


II. Inner whorl of Calyx (Sepals). 

III. Corolla (Petals). 

IV. Outer whorl of Andreecium (Stamens). 
V. Inner whorl of Andrcecium (Stamens). 


VI. Gynzcium (Carpels). 


J. J. SIMPSON 219 


3. Conception of Chorisis. 


Chorisis or reduplication is generally looked upon by botanists as a means of 
multiplication of the parts of a flower. It consists in the division or splitting 
of an organ in the course of its development by which two or more organs are 
produced in place of one. Chorisis may take place in two ways: 


(1) transversely—when the increased parts are placed one before the other, 
that is, the resulting components are on the same radius ; this is known as vertical, 
parallel or transverse chorisis ; 


(2) collaterally—when the increased parts stand side by side, that is, on the 
same circumference. 


Transverse chorisis is supposed to be of frequent occurrence ; thus the pistils 
of Lychnis and many other caryophyllaceous plants exhibit a small scale on the 
inner surface at the point where the limb of the petal is united to the claw. The 
formation of these scales is supposed by many to be due to the chorisis or unlining 
of an inner portion of the petal from the outer. 


Collateral chorisis is seen in different natural orders. In Strephanthus, in place 
of two stamens there is sometimes a single filament forked at the top and each 
division bears an anther. This is usually supposed to be due to collateral chorisis 
arrested in its progress. 


The flowers of the Fumitory are also generally considered to afford another 
example of this type of chorisis. In these we have two sepals, four petals in two 
rows and six stamens, two of which are perfect and four more or less imperfect. 
The latter are said to arise by collateral chorisis, one stamen being divided into 
three parts. 


Collateral chorisis may be compared, according to Bentley, to a compound leaf 
which is composed of two or more distinct and similar parts. 


Let us now consider chorisis in its bearing to the flower under consideration. 
In the description of the morphology of the flower we noted that in the inner 
whorl of the andrcecium there were four stamens arranged in pairs while in the 
outer whorl there were only two stamens situated singly. Various opinions have 
from time to time been advanced to explain this anomalous structure so that it 
might be well to briefly review these. Of the andrcecium of the Cruciferze Oliver 
says: “The two pairs of long stamens are generally thought to be due to chorisis 
or the division in the course of development of single antero-posterior stamens. 
Others have thought that the six glands represent abortive stamens and that these 
with the six stamens make up a normal series of twelve in three whorls.” 


De Candolle held the view that the stamens formed a single, originally 
tetramerous whorl alternating with the petals in which the median members, 
Le. the anterior and posterior, were cleft (chorised) in two. Since however the 
lateral stamens are inserted lower down than the median stamens and are also, 


220 Contribution to a Statistical Study of the Crucifere 


as already pointed out, formed earlier in the bud, this view is clearly untenable. 
Two whorls must be taken into consideration owing to the difference in the levels 
of insertion, the single stamens being lower down. Kunth, Wydler, Chatin and 
others regard these two whorls as typically four-membered (tetramerous), those of 
the outer whorl corresponding in position to the sepals, those of the inner whorl 
corresponding in position to the petals. To arrive at a typical cruciferous flower 
from this, two stamens in the outer whorl abort, while the individuals of the two 
pairs of the inner whorl come together. (Plate I, fig. 3.) 


Others (Krause, Wretschko and Duchartre) regard the outer whorl as typically 
dimerous (i.e. with two constituents) and the inner whorl as typically tetramerous 
(i.e. four-membered). 


The more modern view, however, regards both whorls as dimerous but the 
inner one chorised collaterally thus giving the typical cruciferous flower. 


The reasons put forward to support this theory are as follows: 


(1) The upper long stamens are usually paired in the median line, also 
sometimes coherent. Further, in place of one or both of the pairs, there occurs 
sometimes a single stamen—a hint at reversion, or one or both pairs may be 
replaced by three or more—a suggestion of further chorisis. 


(2) In the earliest visible stage of development in the bud it may be seen 
that each pair of stamens arises from a single wart-like projection and that division 
is therefore a secondary result. This is not very easily demonstrable in the 
Cruciferee but is more evident in a closely allied family, the Capparidacee. 


Since the present study includes numerical variation in the different constituents 
and positions of the andrcecium it will be interesting to note to what extent any 
one of these theories is borne out by the variations in this flower. 


4. Orientation of the flower. 

Having defined the positions of the various stamens relative to one another, 
in what is usually regarded as a normal cruciferous flower, let us now consider the 
different possibilities when the flower is abaormal. 


Suppose that one of the pairs of stamens of the inner whorl is represented by 
a single stamen, that is, suppose that chorisis had not taken place. Now with 
regard to the peduncle of the inflorescence this stamen might be placed in two 
diametrically opposite positions, namely (1) it might be adjacent to the peduncle 
(Plate I, fig. 4) or (2) it might be on the distal half of the flower with reference 
to the peduncle (Plate I, fig. 5). 

Two questions now arise, (1) do non-chorised stamens occur as frequently as 
chorised stamens on the side of the flower next to the peduncle? or (2) do either 
of these occur with greater frequency in this adjacent position ? 

According to which of these questions is answered in the affirmative must we 
conclude whether there is any connection or correlation between the proximity of 


J. J. SIMPSON 221 


the chorised stamens to the peduncle and chorisis. The former would suggest no 
correlation, whereas the degree of correlation hinted at by the latter would depend 
on the frequency of the occurrence. 


We have so far considered only two possible positions, viz. a non-chorised 
stamen adjacent to the peduncle, i.e. in the proximal half of the flower, and a 
non-chorised stamen opposite to the peduncle (i.e. in the distal half of the flower 
with reference to the peduncle), but the question naturally arises “ Are these the 
only two possible relative positions which might occur?” Might the petiole not 
twist so as to bring the hypothetie non-chorised stamen into any position varying 
from 0° to 180° with reference to the original plane ? 


Let us illustrate this by means of the Figure 6, Plate I. 


Taking the position of the peduncle as our fixed point the non-chorised stamen 
might occupy the “adjacent” position a or the “opposite” position al. A rotation 
of the petiole, however, might cause this stamen to occupy any of the positions 
marked a2, a3 or a4 or even any intermediate position between a and al on either 
side of the vertical plane A—B, in the horizontal plane a, a4, a2, a3, al. 


In a study of the variations in this flower, this is precisely what was found to 
occur, i.e. the distribution was equal round a fixed point so that we are unable to 
say whether there is any connection between the proximity of the non-chorised 
stamen to the peduncle and chorisis or not. 


But the full bearing of this consideration does not end here. The orientation 
of the flower is of practical importance in fixing a basis on which to establish 
a grouping of the different variations. Any analysis of the data is impossible 
unless some definite part of the flower be agreed upon as a starting point. 


Now we have seen that the position of the peduncle with respect to any 
definite stamen does not require to be taken into consideration. Consequently 
we may take either of the two stamens of the outer whorl, which correspond in 
position to the outer sepals and which are “normally” non-chorised, as our fixed 
point and call it 1; the stamen opposite, i.e. in the same whorl, we shall call 2; 
the chorised pair of the inner whorl to the left (or in the floral diagram above) 
may be termed 3 and 4; while the corresponding pair to the right (or in the floral 
diagram below) would thus be 5 and 6 (Plate I, fig. 7). 


Where variations occur in any of these stamens we shall hereafter refer to 
those as occurring in “position” 1, 2, 3, 4 and 5, 6 respectively. 


On this basis of symmetry, it will simplify matters considerably if we regard 
as 1, in flowers in which either of the two outer stamens is modified, that one 
which still maintains its original character while, on the other hand, if both are 
modified, that one which retains the greatest approximation to normality, e.g. if 
one be chorised while the other is not, the latter would be in position 1; or if one 


Biometrika x 29 


222 


Contribution to a Statistical Study of the Crucifere 


were chorised while the other was only partially chorised* the latter would again 
be in position 1. 


Following on this it is at once seen that where both are normal or where both 
are equally abnormal it makes absolutely no difference which position we choose 


as. 


TEI: 


1. Classification. 


EXAMINATION OF THE DATA. 


Considerable difficulty was experienced in classifying the variations owing to 


these occurring in so many different forms yet with so few characteristics in 
common as to warrant their inclusion in definite classes. 


The total number of flowers examined was 1832, of which 1062 had the 
accepted normal structure (see page 218). The remaining 770 showed variation 
in different degrees of advance or regression, i.e. there was an excess or deficiency 
in the number and structure of the members of the various organs. Thus we see 
that there was a deviation from the accepted normal structure in over 42 per cent. 
of the individuals examined. 


The perianth has been selected as a basis for classification and Table A shows 
the sub-divisions which have been adopted. Amongst those flowers in which the 


TABLE A. 
Number Number Number Number | Variations 

of in in in in the 

Variations Group Sub-class Class Class 
Class I. Perianth normal hele — — — 1687 = 
Sub-Class A. Gyneecium normal -— _ 1680 — a 
Group (a). Andrecium normal 1 1062 — — — 
Group (6). Andrecium abnormal 57 618 — — — 
Sub-Class B. Gynzecium abnormal — -- 7 — = 
Group (a). Gyneecium one carpel 2 4 — — = 
Group (6). Gyneecium reduplicated 2 3 —_ -- 62 
Class II. Perianth abnormal at ee be — — os 115 — 
Sub-Class A. Calyx normal, corolla abnormal — — 55 — ~ 
Group (a). Gynzecium normal iil 54 - _ = 
Group (0). Gyneecium a single carpel : 1 — = 
Sub-Class B. Both calyx and corolla abnormal — — 60 — = 
Group (a). Gynzecium normal : il 46 — — = 
Group (b). Gyneecium a single carpel 6 14 — — 29 
Totals 91 1802 1802 1802 91 


* For the present we use the terms ‘‘chorised” and ‘‘chorisis” in the sense of the definition already 


given. 


J. J. SIMPSON 223 


perianth was normal there were no fewer than 62 different types of variation, and 
amongst those in which the perianth showed a departure from the accepted 
normal structure there were 29 types of variation. Thus of 1802 flowers examined, 
1062 had the typical cruciferous structure, 625 had the perianth normal but the 
andreecium and gynecium modified in 62 different ways and 115 had all three 
organs modified in 29 different types of variation. 


The remaining 30 individuals are not capable of classification under the fore- 
going scheme but have been grouped into three classes as shown in Table B. 


TABLE B. 
Number Number 

of of individuals 

| Variations | in the Class 
Class III. Reduplication of parts but flowers not separate ... 10 11 
Class IV. Reduplication of parts with flowers separate... 6 17 
Class V. Part of a flower replaced by a flower see oe 2 2 
Totals... se ae a ae aes oh 18 30 


Altogether, therefore, there are five separate classes which give a total of 109 
different modes of variation. 


2. Analysis. 


In the further reduction of the data it is essential that we consider the 
variations in the stamens, and for this purpose we must naturally commence 


with Class I, Sub-class A. 


To avoid describing each of these in detail, it is necessary to have recourse to 
a graphic method of representation. Several such methods suggested themselves 
and although none are ideal we have chosen one which may help to give a true 
impression of the various modifications assumed by the andrecium. We shall 
also give a few examples by another method which might have been adopted but 
which seems to us to be even more complicated. 


Let us, in the first place, consider in what directions abnormalities have 
occurred. A typical stamen consists of two parts, (1) the filament and (2) the 
anther. 


(1) Filament. This may be of its normal length or less than its normal 
length or altogether absent. 


(2) Anther. This may be present or absent. 


But other complications arise. As already explained, in the accepted typical 
cruciferous flower, chorisis has taken place in positions 3.4 and 5.6 so as to give 
29—2 


224 Contribution to a Statistical Study of the Crucifere 


rise to two stamens in each of these positions. Now, we find that, in certain 
flowers chorisis has only partially taken place and in others it has not occurred 
at all so that we have thus another three possibilities to consider. 


In describing the androecium, therefore, we must (1) define the position of 
each stamen to which we refer, (2) state the nature of the filament, (3) note the 
presence or absence of the anther and (4) emphasise the nature of the chorisis. 


Let us use the following symbols 1, 4 and 0. 


Filament, indicates that it is present and complete. 
1 with reference to 4} Chorisis, indicates that it is total or complete. 
Anther, indicates that it is present. 


jeulaetent, indicates that it is only half-length. 


4 with ref oe Suleesee se : 
g Wim retenence to (Chorisis, indicates that it is ouly partial. 


( Filament, indicates that it is absent. 
Chorisis, indicates that it has not taken place. 
Anther, indicates that 1t is absent. 


0 with reference to 


We have already fixed upon our nomenclature for the various positions; these 
are 1; 2; 3.4; and 5.6. To avoid descriptions and at the same time give a 
graphic representation of the floral formula of the andreecium the following 
system might be adopted: 


(1) Place the whole floral formula within square brackets thus [ _ ]. 
(2) Place positions 1; 2; 3.4; and 5.6 within curled brackets thus { }; and 
(3) Place individuals, i.e. 1, 2, 3, 4,5 and 6, within rounded brackets thus (_ ). 


Expanding this with reference to a normal flower we would have for the 
andreecium only 


[12} {2} {2)], 


or still further in the order of Filament, Chorisis, Anther, Stamen 


(1.071) G.0. 1) (1.1), Ge ea Gea i 


Or, taking an actual example from our data : 


Stamen number 1 is normal and complete, and there is no chorisis; stamen 
number 2 has a filament only half-length but the anther is present and complete, 
and there is no chorisis; stamen number 3 is normal and complete; stamen 
number 4 is only half-length but with a complete anther—chorisis between 
3 and 4 is complete; stamens 5 and 6 are only half the normal length but have 
complete anthers—chorisis between 5 and 6 is complete. 


This would be represented thus: 
(ia.0.1) 0% DGD) iG eae a) Geel (a sea) 


J. J. SIMPSON 225 


Another graphic method and the one which we have adopted is as follows. 
Each flower is represented in a table similar to the following: 


Number of 


Diagram Stamen Filament Chorisis Anther 


Frequency 


The first vertical column gives the frequency of the variation or the number 
of individuals examined with this structure. The second vertical column gives 
the number of the corresponding diagram in the plates. The third vertical 
column gives the individual stamens in the positions already defined while the 
other three columns denote the various factors to be considered. The different 
possibilities of variation in these may be shown by the symbols 1, 4 and 0 as 
already defined. It should be noted, however, that in positions 1 and 2 a dash 
(—) will be placed in the chorisis column to indicate that these are typically 
non-chorised stamens and that absence of chorisis does not therefore indicate 
abnormality. Representing the same example as before, by this method, we 
would have: 


Frequency eee Stamen Filament | Chorisis Anther 
1 1 — 1 
2 s = 1 
3 1 1 il 
4 4 1 1 
5 i 1 1 
6 4 1 1 


The following table shows graphically the types of variations illustrated in 
Figs. I—LVIII, ie. Class 1, Sub-class A, flowers in which the perianth and 
gynecium are both normal. 


It will be seen that in the flowers illustrated in Figs. XLVIII—LVIII another 
complication has crept in. Stamens 8, 4, 5 and 6 have themselves sometimes 
undergone partial or total secondary chorisis. In the tables, therefore, by sub- 
dividing the squares containing the details we can thus adhere to our initial 
nomenclature. Let us take the three most difficult examples to illustrate this. 


(1) Fig. XLVUI. The division corresponding to stamen 8 is sub-divided. This 
would indicate that in this position there were actually two stamens, The nature 
of each of these individual stamens is, as before, given in the sub-divisions. In 


Contribution to a Statistical Study of the Cruciferae 


226 


TABLE C. 


ToYUY 


Free | fmm at Ct | 


ad et ae reed Fl me 


Ls te ie en 


Saenn es 


aoe 


ant Ono 


meOnsa AS 


SISTIOYS) | |=so0 | |aaae | |aHos | |azunoo | |oooo | loess lelisisters | |aaae 
qUoULe ll Lo ee ee SS indin Lo ee ee eon >) SS itn iiaen lien linia! Seas HO Ss OHNO aie lee | 
Lecleeueatel 
mA Hd mA OH 1d mn Hud 6 mG O Huw 6 mM Hid 6c ra Go SH 1d 6O mA Hid © ram OD Hud 6 
jo qoquinyy 
Q, om lal > 
— 
Weisel iad ES LA I > HH bd ba 
jo soquany = 4 x be > ba Ss 
al a 
) 
Aouonboay oO oo t 00 o + a mo 
+ 


JoyJay on aes HO aaa aeaee eS aoa tn oe oe ce ee aor OnO aaa 
SISILOYD) | | Aa didn | poaeae | | HHOO | paaee | CS) j |oooo | ES solo Jaane 
JUOTAC TL [eee ee oe ee IN See HHO IHN SS A HOHNO ts en I oe a ono mame OrO Sd a oS 
uatey}9 | 
mM & aH 1d 6 mao Hid mao Hud © mA +H 12d mA Hid © mA) 19 mG oO stm 6 HAN OD Hd 
JO TOq UIT NT 
Q, - a lam am 4 
weASeIC, es = - = i > = 
jo qoquinyy | > 4 bd oA a rs 
, MN bd 
I~ 
Aouenbea a = a =i oO = = =) 
nN N Lal 


ToyJay fo al es ft mee oe ee ee niles edie ieee! lito oon oe aoe HO Ses Me ee oe el eer Onn Cai en ie ie ie! 
eco meee eo oe ioe) feos ihe) iesees| Gisceen pees 
eee pon ae a SHIN aan a i ai Saeed CO SS Hin See Odea me AAAS 
UsTLeYS z ie a - 


jo roquinyy 


mG oD st im 6 


mG. OD StH 1d 


maa oO Hid © 


TBIsRIC 
jo raquinyy 


I 


VII 


XVI 


XIX | 


XXII 


fouonbed,y 


1062 


38 


2 


2 


J. J. SIMPSON 


TABLE C—(continued). 


Sees HO 


SeOnsa eS 


Ses SSeS 


woyqUy ee See aS a See ee 
| 
SISIOyD | |e Hoo | | aaa | |paunoo | | aaa | |auHoo | | aaae | | aaa | |aaeeaee 
QUOTAVT TY SANA RS mS eS in re HN Hae © Rae SS SHINAI RI O qeHOn saan HNO A SaaS KH eae eR Re 
uoUIeIS “ é é , 
rma GN oD Hud € ma oD Hun moIAN oO Hid mA OO Hd rma OO Hud © mA OD Him 6o mW O Hid oN MO HTH OO © 
jo OQ TUN AT 
tH 
en b om FS 
ae 5 s IIIXXX IAXXX XIXXX . 4 ILIATX 
Jo qoquiny “A 2 aah AA PN oS? tA tA ’ 
iN r d 
A : _ 7 as : 
Aouonbaty ar) 4 ry cor) a 4 a 09 


wayyy 


foe ee oes ee Biel oe 


ae Aaa 


Se HOHO 


ae 


sone sO 


SISLIOYO 


| Jeaae 


| |auoo 


Jama 


1S... 


| [aaa 


| joa 


| Jaane 


| |nnoe 


quowUe [Ly 


mS ina 


aR a AIN© 


mt eS AAA 


ry AIAAING HINO 


INGA Re HINAaI|oa 


Ss On eee 


me OO se AAR 


AIO aS eS RINO 


UdTUR]S 
JO TOQTUAN NY 


mG) SH id 6O 


NN 1) Hid © 


NOD SH 10 0 


my GI) SH 1d © 


mG OD Hud © 


mA OO Hid 


aN Hd © 


ITXXX 


JIIAXXX 


XLIV 


XLVII 


a ee bd 
WIeISVICT a iS 
jo aaquam Ss ~ 
“ “ 

Aouenbaa yy a on 


| 


BHOnaae 


3} 


es Onr HO 


TIQIUY Sonia ieee tiene ie! see HO Sr Sena HO ee | mOn AS 
SISLIOUY) |oaae | [| aHooO | HH0O | | HHOoO laa | jaa | Hn oO | | HHoO ad 
f 
JUOUe TL IAS Ss in sda HO ta ic oS a AladiaO RIN SICIR a  in re OnrnaH HOnm RANA IO eA AHO 
WEUIey}G 9 69 +H 1m 6 Ho Hid © 
jo zaqmny mao 10 co mA Hid © mA Oo HD mA OD Hud mM” oO} Hid 6c MN oD Hid ce Seno 1 on 
| 
tH = 4 

urease iS 4 Sy = > 
jo aaqman yy 4 TITAXX “4 INGIPXOXEXG TIAXXX tA nS = 

ms 4 ~ ey 
Aouonbary ~ rt 0 ar) na 9 a) 4 


ferce 


t 


Contribution to a Statistical Study of the Crue 


228 


TABLE C—(continued). 


loyyuy 


talib) hl! 


tt tl ia tl tela 


SISIIOYO | |aroor | |SSseoe 
JVUOUTAe TT Cs es ee Be oe Cn is oes ee ee 
ueureyg 
aa st 10 aN tH 1D © 
jo qaquinn 
| 
WRIseIg = = 
jo qaquiny | A =| 
| 
Aouenbary. 10 4 
Toyquy Loon lites Mien len en een ee itn Bien ee eee 
SISTIOY | |aaenoO | | OOnaAs 
QUOULe TTY I See ae 
TOUIeyG 


JO Toq Un Ny 


UIB.LORICT 3 = 
jo zaqunyy 5 
Aouonboty 9 a 
TIGA Lelie en endl oe ee eel a 
| 
| 
sismoyg | | | | Aa cmaieictmicn 
| 
JVUOULe LY See ea SS ea 
uaTIeyS 


JO JOQ wan py 


UUIBLoRIT 
jo roquinn 


Aouonbar gy 


TABLE C—(continued). 


yay Sen One ot 
SISTIOYL) | |oOORn AA | |pnoorn 
JUOULC TL Sen Ona ROnnR ASS 
Goutes aN ost 1D CO Pano st 10 
N au 
jo zoquinn 
3 om o 
UIRIOvICL s = 
jo qoquiny ‘S) = 
4 
Aouonbea gy st a 
Loyqay (nt oe en en ee en Le en A oe en ee 
SISLIOY() | \Iesio.or | [pase 
JFUOULVIL Lo on oon ts oe Co (eS oe en | 
eats) mao st 19° 6 NOH 19 € 
3 = ac al 0 
jo coquinn im 
UWIRISVI(T = = 
o caquin am 
| POR LOC LUNG 5 
| Souonbory n 4 


J. J. SIMPSON 229 


this example what really occurs is: There are two individual stamens each equal 
to the original length and bearing a complete anther and separated from one 
another by secondary chorisis. 


(2) Fig. L. Divisions 5 and 6 are sub-divided to show that there has been 
secondary chorisis in both of these stamens. In the former, chorisis has been 
complete but has resulted in one being full-length and with a functioning anther 
while the other is only half-length with a functioning anther. In the latter, 
chorisis has not been complete inasmuch as only the anther has been chorised. 


(8) Fig. LIT. Divisions 5 and 6 are both sub-divided, consequently we may 
infer that both of these stamens have undergone some stage of chorisis. In the 
first column we see that all are full-length, in the third that all have functioning 
anthers, but the second tells us that chorisis has been only partial in each case. 
The symbol } between 5 and 6 indicates that between these two chorisis has also 
been partial. Thus we conclude a state of affairs as follows: In position 5.6 
(1) there arises a single filament which divides into two at some distance from 
the base and (2) that each of these again sub-divides and (3) that on the end of 
each of these four sub-filaments there arises a functioning anther. The others 
may be worked out in a similar manner but a reference to the diagrams will at 
once obviate any misrepresentation. 


From the foregoing table and illustrations it is evident that further classi- 
fication is possible but it would be well to point out here certain difficulties 
which arise. As an example let us consider such a case as (using our original 
terminology) that in which, in any of the positions (1; 2; 3.4; or 5.6), the stamens 
are represented thus (1.1.0) (0.0.0), thus ($.1.1)($.1.1) or thus (4.0.1) (4.0.1). 
Which shall have precedence? If we are to consider these variations as deviations 
from the usually accepted normal cruciferous flower, then we may safely assume 
that that flower which has the greatest number of functioning parts in a certain 
position is less aberrant than one in which any or all of the parts are altogether 
wanting; while, on the other hand, if in a position in which chorisis normally takes 
place, we have defective groups like those in cases 2 and 3 cited above, in one of 
which chorisis has taken place but not in the other, we must consider that group 
in which chorisis has occurred as being the one less removed from normal. On 
this basis then the above examples would be placed in the following order with 
regard to normality : 


GaGa yd. bl) (2). G.0.1)G.0. 1); (3) (1. 1.0)(0..0..0). 
Similarly for any of the others. 


Consequently we are now in a position to classify the actual cases under 
observation. 

So far we have considered only those flowers in which there was the typical 
number of stamens, with their manifold variations in size and structure, but now 


Biometrika x 30 


230 Contribution to a Statistical Study of the Crucifere 


we must classify those in which secondary chorisis has given rise to more than 


the accepted number. 


~ 


Let us take stamens 5 and 6 as our basis, ie. those individuals in which 


position 5.6 is occupied by more than two stamens. 


The relative frequencies of the different types of variation in the andreecium 
in the 618 specimens so far considered (see Table C) are very interesting. 
The number 1062 in Table F refers to 1062 flowers in which the andreecium was 


TABLE D. 


Number of 
Variations in 


Number of 
Individuals in 


: Sub- 9 Sub- 
Section group Section group 
Group a. 
Whole of the andreecium normal. Fig. I — 1 — 1062 
Group 6. 
Andrecium variously modified ... 
Sub-group a. 21 = 480 
Outer whorl normal (5 and 6 variously modified) — — — = 
Section 1. 
Stamens 3 and 4 normal. Figs. [I—IX 8 — 419 — 
Section 11. 
Stamen 3 normal, 4 represented thus (4-1-1). Figs. X—XIV 5 _- 25 — 
Section iii. 
Stamens 3 and 4 thus {(1-0-1) (1-0-1)}. Figs. XV—XVIII ... 4 — 9 — 
Section iv. 
Stamens 3 and 4 thus {($-1-1) ($-1-1)}. Fig. XXII 1 — 2 — 
Section v. 
Stamens 3 and 4 replaced by one. Figs. XIX—XXI ... 3 == 25 — 
| Sub-group £. 13 —_ 54 
| Outer whorl represented thus {(1-—-1)(4-—-1)} —- — = = 
| Section i. 
Stamens 3 and 4 normal. Figs. XXJJJ—XXIX 7 _— 42 — 
Section ii. 
Stamens 3 and 4 thus {(1-1-1) (4-1-1)}. Figs. XX X—XXXIV 5 — 11 — 
Section ili. 
Stamens 3 and 4 thus {(§-0-1) (0-0-0)}. Fig. XXXV ... 1 — 1 = 
Sub-group y. 
Stamens 1 and 2 represented thus {($-—-1) ($-—-1)} ... — 4 = 16 
Section 1. 
Stamens 3 and 4 normal, Figs. XXXVI and XXXVII 2 — 11 — 
Section ii. 
Stamens 3 and 4 thus {(1-1-1) (4:1-1)}. Figs. XXX VIII and XX XIX 2 _ 5 = 
Sub-group 6. 
Stamen 1 normal, 2 absent ... — 5 = 29 
Section i. 
Stamens 3 and 4 normal. Figs. XL—XLIII 4 — 27 — 
Section ii. 
Stamen 3 normal, 4 thus (4-1-1). Fig. XLIV ... 1 — 2 — 
Sub-group e. 
Stamen 1 thus (4:—-1), 2 absent. Figs. XLV—XLVII — 3 — 6 


J. J. SIMPSON 


231 


TABLE E. 
Number of Number of 
Variations in | Individuals in 
Section eee Section oa 
Sub-group 7. 
Stamens 1 and 2 are normal — 9 — 30 
Section i. 
Stamens 3 and 4 are represented by three. Fig. XLVIII 1 = 3 —- 
Section ii. 
Stamens 3 and 4 are normal. Figs. XLIX—LI 3 —- 16 — | 
Section iii. | 
Stamens 3 and 4 represented thus {(1-1-1) ($-1-1)}. Fig. LIT 1 — 2 — | 
Section iv. ; | 
Stamens 3 and 4 thus {(1-0-1) (1-0-1)}. Figs. LITI—LV 3 = 5 — 
Section v. 
Stamens 3 and 4 represented by one. Fig. LVI 1 == d — 
Sub-group 6. : 
Stamens 1 and 2 thus {($-—-1) ($-—-]}}. Fig. LVII —- 1 — 1 
Sub-group x. 
Stamen 1 normal, 2 absent. Fig. LVIII —_ 1 — 2 
TABLE F. 
Frequencies more than 3 in order of magnitude. 
(References have been made to the figures.) 
Figure Frequency} Figure Frequency Figure Frequency 
| 
I 1062 XXVIII 11 XXV a 
VIII 227 XXXVI 9 VII 6 
III 130 VI 8 x 5 
IV 38 XII 8 XLII 5 
XX 21 XIII 8 LI 5 
XL 18 XLIX 8 XVIII 4 
XXII 15 Ix a LVI 4 


normal. 


Where variation occurs, the greatest frequency, namely 227, occurs in 


flowers in which one of the pairs in the inner whorl is replaced by a single 
stamen while the next highest frequency, namely 130, occurs in those flowers in 
which partial chorisis has taken place in the inner whorl of the androecium. 
Following this the magnitude of the frequencies diminishes rapidly. The next, 
namely 38, occurs in flowers in which nearly all the parts of the androecium 
are modified while, near this, is the frequency 21 which exists in flowers having 


only one stamen in each position. 
we find that stamens 1 and 2 are involved. 


In the next two frequencies, namely 18 and 15, 


30—2 


232 = Contribution to a Statistical Study of the Cruciferce 


From these raw data we can see that the inner whorl of the andrceecium is the 
whorl most subject to variation and further that this variation is in the direction 
of a decrease in number. 


TABLE G. 
Class I, Sub-class B: Perianth normal, Gyneciwm abnormal (see Table A). 
saleseey Seales ae me 1S ob ede 
et eS lie valle tes Willers eae Perse) Were eb |) ee tae il ae i) 3 
5) os og 2 ne s a) Ons, lone 2 Z a 
2) 2h |28| 8 | S| S72) 26 (48/8) 5] s 
o| 82 (dela |S ) Sele ee ee Sake eee 
| etal, a Bran sea |e 
| 21) E 
| 1 1 — 1 1 1 — 1 
2 1 — 1 2 1 — i 
‘ - 3 1 1 1 ‘a 3 1 1 1 
¢ Pes 4 1 1 1 ; Ibs 4 1 1 1 
| 5 1 0) 1 5 1 1 1 
6 0) 0) 0 6 4 1 1 
l 1 1 1 1 1 _— 1 
ep tenis Ooa|ot )|s na 
2 1 1 1 3 1 1 1 
; 1 1 1 4 1 1 1 
: SoM. Ona lpeil alee ee i. | lentil 
1 SET 4 i 0 1 2 | LXITI 5 l 1 l 
1 1 1 6 1 1 al 
5 1 Lele 1 1 1 
1 1 1 
@ aa Oak a 


Class II: Perianth abnormal. 


Variations in the members of the perianth (calyx and corolla) have necessitated 
the introduction of new symbols in the diagrams. These are shown in the com- 
posite diagram Plate I, fig. 10, and are explained on p. 257. 


It will be evident from Table H, p. 233, that the same type of variation in the 
andreecium occurs with different types of variation in the perianth, e.g. in the 
second and fifth figures no fewer than six different variations in the perianth 
accompany a single type of variation in the andrcecium. Reference to the 
diagrams in the plates will show what these variations are and will render a 
detailed explanation unnecessary. The asterisk in Table H under LXXVIII 
indicates that there has been adhesion between stamen 1 and one of the 
stamens in position 8.4, in other words between one of the members of the 
outer whorl and one of the members in the inner whorl. 


Class ITI. 
The members of this class are characterised by a reduplication of the various 
organs but without separation into two distinct flowers. There are in all 11 


individuals with 10 different types of variation. A word of explanation is 
necessary with regard to the interpretation of the position of the various stamens 


233 


J. J. SIMPSON 


TABLE H. 


Class IT. 


IaqQuy 


SISTIOU/) | Jaane On saa Ornate | | HHOO | JaaaR 
JUOUeT LA Bae i en ROn RA A qaIO eS SSS Se een eS me 
U9TBYS 
Jo Taqman Ny rae” O stud © rN tid © mA Hid © mG OD Hud 6 mM Oo tid 6 
: 7 - 7 — 7 a ; at 
iS ial = 
4 | (ct i lool 
El rbd | > i > 
TWRASVL(T ie sis | rd S “4 
O Joq Un Ny 5 od K | q q r 
seats - re | bd < | 
| 4 4 iS 
Kouonbot gq 4 ao a 4 ar) 


ee 


TIQIAY BS SSS Sieh! HOnnHO CORR eA Ss SAHnROHO 
SISLIOY() | Jara | Ce 4HH |onmoo | | aaa ae | |ooos 
qFUOTAV TTY [on en es en ee HOnnHH BHOnHHO COOn nA HAH SseqeHOHO 
UOUIeYS 
mn Hid co mG oH 1d 6 ra Oo Hud aA SH WO HA OD 19 6) 
Jo oquUINN 
: = te 
eaeten! > Sey = = 4 
a lool lous Il > =! 
TWRISeIT Matnlsl ote ea eal bo Fe be be 
bd bad Atisiilets) bd sf ( 
joaoquny | KM OK SC I IOISIOn : Co bd iS 
F etter | 4 3 PS bd PS bl LA rbd iS 
et | 4 SoH ei 4 4 
4 4 4 
| | aa 
Aouonbery pees eb ona re ~ ati ay 


I3Yquy aA ODO tre emal et Tire) AON sae OO nan aaa HO 
SISLLOY) | HarIND OOO | | HHOO One aA A | Jaa | | xHoO 
Quogue TY re en) eee ee) oO AnAARaRA SCOn nse ile) 
| 
se tehesuc 
BONES) aN MO H19 aA H 19 rN 0%) +H 10 6 mn St 19 0c mm OOD tH 19 © 
jo raquin yy 
| = a 
ey 
= 
acie . ig = 
WeASVIy Ss Site wn BA > 
jo qaquan yy 4 Hr Ss 2 le 
4 S 4 
S 4 
Souenbeaq om OAH = 4 0 


The stamens in the outer whorl are longer than those in the 
Consequently if these were reduced in length they might easily be 


mistaken for members of the outer whorl. 


in these flowers. 
inner whorl. 


In all cases of difficulty, however, the 


crucial test, the point of origin, was applied and the positions assigned to the 


various members as shown in the diagrams were determined microscopically in 
See Table I, p. 234. 


this manner. 


TABLE I. 


Class IIT. 


234 


Contribution 


to a Statistical Study of the Crucifere 


ToyIUy Ls en A oe cn Be Be Le Ae oe Be Ln es oe ee ee ee 
SISTLIOUL) aes Ae HnOO ANA Ar ae aes wae | ee! aa 4 
JVUOULET TY E> ee De Le os Le | Sees As SS 
UdTUeYS 
jo qoquin jy La! nN fan) st awd we} La NO Ho Ne} aN SH 1d We) 
tH HH 
UIVISVICT > a 2 
o Jaquin o ‘S) 
JO 1oq TUN y ma 
Aouonbo.ty = 4 - 
| Toya y Ss ee oe Ln en es nn ce ne 
SISTIOU/) Se Ane ARS AR OO Ae Ses en en en en ee oe ee 
qVUOUMETTY Ca i A | tela feel Sool) ih Geli} abel 
ue UTS N oD ~—s Ye) Ke) N ise) Q No} 
rt 5 A) q | al 
JO Taquin 
WWIRISVICT ‘ws = 
jo raqmmnyy o o 
a 
ouonbaty — 4 
ToyyUaV Cn en Bc ne | oy P teil) tl atl hie! Cc ee ns ee | 
STISILOUL() COO FRR AAR Ae COO mem OCC CO AF ae eqn ee 
JFUOTAET LY Eo en On ne sear eo | ae sans me 
UdUIvYS 
0 roqmanyy La ae) st re) Ne} aN oD st Nes) Ne} Lan) GW OD sH 19 co 
Ri h | 
rn 
areasely a a = 
oO 
jo taquinyy a a o 
q 
: i) 
Aouonbaty 4 4 = 
| 
Toy ae A Be oe Be ne Do eo Leen en ec ee a 
SISLIOU() SO ROR NN OR ee | ee De 
qFUOULe LY Ee en ne ee ee ee ee Do eo Se Se ee RR ORR Ae 
U9U’YS 
jo Toquin yy Lal AN oO st New) Ne) = ie ine) SH ite) We} 
= = 
TIVISEL(T iS > 
jo Jaqmnn S oO 
oA a 
ouenbaa yy a 4 


TABLE J. 


r 


Class IV 


J. J. SIMPSON 


jo roqmunyy | 


ma Hid 


JIYJUV aad aes HO aan One 
ssuoyg | | |arnan | | [ance | | loons 
PUNTA HY AAR AARAAAy ae AHO Neo OSS 
Lecslandental 
mo GOD SH 1d © oA Hid © aA Hd © 
JO ToQuUInyy 
rH 
wIRLsRly > = = 
; Ss ein Q 
jo coquin yy oO ro) a 
eS) 
TIyqJUY SSS See (oomat (ree Leen Veer Leal poo) 
UOTE TY oes As ee Se eee ANA aa 
Ce ubS roa GI OD) SH 1 mA Hud QO stud © 
GY.) GN 6%) S ce oY) 
jo Joquinyy 
UIeISvIC, > = = 
: ay > ease 
jo zaquin yy Oo So > 
ie) 
Aouonbeay 4 Ne) =| 
Toyyuy Sessa See eS ae ORO 
SISLIOYY) | [aaa | |oorn | losers 
qUuOULe TTY Len oon Ben Aloe el Cs in ie ee ae | aeHOnO 
waIeyS 


| mao Hud 


wmeasvlqT a Eis ae 
jo toquinyy | oe) oO is 
JIYUV loealiaen ies aon iii Salil leet ae ee! ae ee 


que] Lp 


reStasoedqet i ed 


Sasser 


SS St 


ume 
48 mao Hid aN OD Hid © mM Oo Hud 6 
jo rtoquinyy 
ureaselcy = > = 
jo raquin yy 5 > ON Es ‘ 
ouenbar yy Ney o ry 


236 Contribution to a Statistical Study of the Orucifere 


Class IV. 


In this class there are 17 individuals giving six different modes of variation. 
Reduplication has taken place to such an extent as to give rise to two separate 
flowers on one pedicel. Each of the flowers was diminutive in size. Two tables 
are thus necessary for each “ flower,’ A and B: see Table J, p. 235. 


Class V. 


This class has been formed to include two very aberrant flowers showing two 
distinct variations. In both cases part of the flower has been replaced by another 
flower, in one case normal in the other slightly divergent. CIX is one of these 
in which the original flower is normal except that one of the carpels has been 
replaced by a small flower (see Table K and diagram, Plate X). CX is the other. 
In the original flower stamen 1 has been chorised and one of the chorised parts 
has given origin to a separate flower (see Table K and diagram, Plate X). 


TABLE K. 
Class V. 
o v o a Ric o os os a a 
2 ££ | od q 5 ~S = 2 &p 2a g qe 3 
S| eS es s s 5 ge | 8s| 3 | 5 
A = Bn = c y TD a es Ss) 
cap elena, Se ee hee Fe 
1 1 me 1 1 1 aa 1 
2 1 = 1 2 1 ae 1 
eC 3 1 i 1 ; CIX 3 1 0 1 
Z 4 1 1 1 B 4 0 0) 0 
5 1 feta ear 5 1 0 i 
6 1 Ik 2p Pak | 6 0) 0 0) 
; 1 OA ahs al 1 1 25 1 
1 (0) * 2 1 = 1 
; en 3 1 1 1 
a a ee 1 B 4 i 1 1 
ay 1 1 1 5 1 1 1 
1 | CX] 8 1 1 1 6 1 1 1 
A 
4 1 1 1 
| 
1 1 1 
1 1 1 
| 
6 1 1 Weed 
| 


The asterisk denotes the position of the origin of the secondary flower. 


This concludes our analysis of variations LIX to CX both as to perianth and 
andreecium, but before proceeding to the statistical part it is desirable that certain 
peculiarities should be observed and that an understanding be arrived at with 
regard to the interpretation of these. 


J. J. SIMPSON 237 


In this procedure 44 variations, namely LIX to CII, must be dealt with. 
When we study the number of parts which occur in the position of individual 
members of a whorl and then try to draw conclusions as to normality or abnormality 
of the whorl itself we find the following difficulties. 


Let us take the outer whorl of the andrcecium as an example. 


(1) If in the position normally occupied by stamen 1 there were two stamens 
and in the position normally occupied by stamen 2, no stamen occurred, then with 
regard to the whorl the total number of stamens would be two. Now this is the 
accepted normal number of stamens in the outer whorl, so that if number alone 
were considered the inference would legitimately be drawn from the table that the 
whorl was normal. But this is not so! 


Or (2) If in position number 1, one normal stamen occurred and in position 
number 2, one functioning stamen, with the filament only half the normal length, 
occurred, then the number of functioning stamens in the whorl would be two, Le. 
the accepted normal number. But again, on the basis of number alone, we should 
not be able to say whether the whorl as a whole was normal or abnormal. 


Now as this state of affairs exists not only in the whorl under consideration 
but in all the whorls of the flower, we have thought it not only advisable but 
necessary to emphasise these abnormalities as a safeguard in the interest of 
systematic statistical treatment. 


For this purpose, therefore, small diagrammatic formule have been drawn up, 
and these have been given in conjunction with the diagrams: see Plate I, figs. 
11, 12. 


We have already defined the positions of the various parts of the androecium 
but have hitherto refrained from naming the different constituents of the perianth. 


In the cases under consideration, however, it is necessary to do so, and Plate I, 
fig. 11 illustrates how these are definitely determined. 


The two outer sepals are named A and B (see Fig. 11). A corresponds in 
position to stamens 3.4 and B to stamens 5.6. OC and D are the two inner 
sepals ; C’ corresponds in position to stamen 1 and D to the stamen in position 2. 
The petals are named A’, B’, C’ and D’ and lie respectively between sepals A and C, 
Band D, A and D, and B and C. 


The actual order of all the parts is summed up in Plate I, fig. 12 (1—14). 


Biometrika x 31 


Contribution to a Statistical Study of the Crucifere 
TABLE L. Class V. 


238 


| ouenberq 


WIeIsVlCy 
jo qoqruny 


LXI 


LXX 


[eulIouqy 410 
[BULIO NY ‘PIO AK, 


[OU M, 
ul SA9Q (AO TAT 


iS K 
gx <= 
ie] st 


UL taquin yy 


| uorqtsod yore 


Sse KH SaaS So SN 


Len en oe ee ll ee ee oe CS | 


Eee | Seater oS a) a a he aL = mee on enna | 


Se se Ree On eR NN 


UL SaOq UOT 


Aonenbet y oa 
| f 
7 
UWIRLSVICT bd = 
jo qaquim yy 4 a 
4 
|_Tetusoaqy 0 = SS = ~ x 
| [BULION TIO A, 
TLOCTEAN = an © 


worqisod yore 
UL TaquanN 


Aouenbaatyy 


Se oe Be ee oe oe eS eS | 


TURISVI] 
jo qequiuy 


N 
“ 
loot 
| 
‘| 
i S me } 
le sR 
a nN oD Nn wt 


ee ce ee oe oe oe ee eC ey | 


6 


ee ee ee ee ee eS ey | 


| | 


[euLtouq Vy 10 
[VULI0 NY ‘[LOT] A. 


N 
N 


TEOMA 
Ul STOq THOTT 


n | 
TACT | 

2S ~ So | 
< ase aes = 
a nN + a =H | 


uorqtsod yao 
UL faq un jy 


[ZOU 
| jo raquioyy 


Se ee ee | 


Se ee AR eR NN ANS 


[OU M 
jo zaqumyy 


II 


IV 


[i ce A oe oe Be ee eS 


WQ OD HSA GA 


Ese a a SES HO 
FSO SROs Se 
—| —! Ky 
a FR & F&F & 


239 


J. J. Srmpson 


TABLE L—(continued). 


Souanbeat yy 4 | =) = 4 


TURAISVICT = pes ee yw 
jo gaqmany NOON TAX NEDEXAXGT EXSXOXGETI 
[eultouqy 10 x = i ; . = pe es ae, 7 Py ‘ | : » > - 
[VULIO N ‘TOT, = =i q | q | al Po) 1 NX | io | LT NX xq qq N x NX q Ss 
Ou AN es] a fe a oD | a fe] a o a = nN oD a 10 | 4 a rex 4 ra 


UL Seq WoT 


uotyisod yore 
UT TOquIn 


a 


Se ae Ss OO He HN | SS ese ROO HO NN HO Fa FOR HH HO MN sO Fe On OR FRO Ae 


Aouonbaty | a = = 12 
is CARNE es —__~ =e Ea = = Z 
UIeLSPICT DA LA 
jo aaqrany A A EN XEXGT IXXXT 
J a | 
. ab R iM - 
| 
[euLIouq Fy 10 , n _ ‘ » is K R N S 
peumon jou | 4 Se ~ ras ~ x = x 41 x ~ OR 
a ah ge 
see — a = | | 
| me rae | 
TOU M. s AG er m 
an ) > aA) N yD A ies io) © o) a] > G > > 
UL S1oq Wd PT : r 2 = re ¢ = | ~ = F = Ei 
i aaa at” = i es 
uortsod yore 
as tol GJ ie Se iil KO | rt rst St SO GUD OOO. | SO Fe FR Or Fee Ae SHO FR OCOORF BO NN 


UT TOQUIN NT 


Aouonboay = | 4 ™” a 
5 : ~ ee -| = = : e _ c = 
WRIsViI ~ ae 
jo zaquiny ~“ | INTEXEXET TENCX SCAT XxXXT 
= | — 
[eulouq Vy 0 ‘ re pa. si = K fs r 
remmon qaoyyy| 4 as a x i SO nee x ~ se, Sars ~ se aS 
TOT AA, % an ee m= 
UL SIOQ UII] 5 eS ve aa ~ 7a N = an Pu a a oD 4 + 4 er) al 4 Ney 
uortsod yoro 
Quts SSO OR RRO ORO AN lil Lot ee) hh (Re -—-O Sm FR On FO NN sO Fn RR On FO MN 


UI JaquIn 


[LOT AA. aS. See HO Se HO ee ee +H ae + 
Jo zaqtueyy AIS SSP Silas ee WR OQ WRSQA WAN EO =~ SA FROR RIA Resse SR CIQ Sasa oN fis 
TOU. = Se a 

sas Sg Fa ba See NG OS A) eS A ee he ieee he on St oii age 


31 


v 


fere 


y of the Crue 


Contribution to a Statistical Studi 


° 


TABLE L—(continued). 


240 


UL S1Aq WATT 


Aouionbaay = 8 a o 
eee AXXXT HIAXXX'1 S 5 
jo toquin ai A 
| eee a = x = 8 x x be x S} oS Pe = x hg 
TOU AL ~ re an a n a Se = — ON ve) ds 


| 
UL Taq uIN N 


uorytsod yore | 


NO Ae Few COO NM 


Se ae OOR Se SBR NN 


OO8 Ss Ol Oi Oe eGliGl 


| 


AN Aw BANS ANN OW 


UL SLoqure fT 


Aouenbe.lyy a 4 o = 

} ee | : —! 

WRIST it paste S) = 

Jo zaquan x. AIXXXT TIAXXX'T nA g 

d 

yeuLtouqy 10 es = seal ae ie we ee is ta; 
came drem |) OS x = Ste. oa ‘ = { = q saad ~ & ‘ Sees 
TLOmT AN ° ~t a a tH a - a Ney oD 4 a) a 12 oD 12 


uorqtsod yore 
ul Jaquinyy 


IO) tls! eS Se INS NIN 


Le re ee ee CS | 


a0 Fe FONN FAN HO 


Tc eee 


AN FAR BRN NT NM 


ouenbad yy 4 4 Ne) 4 
UIBASRI(T owe wXY XIXXXT : 
46 sequin TIIXXXT IAXXXT DOO S 
s 
ean 2 ee [ss ss oo ee 
IOUA 
TOU M, o <H a o nN ° = a a a nN || ca ~ o9 a +H 
UI Sdoquey | 
| uorztsod yore Sieiis: an HH OO HHO0O OO SNS) eS Sate het lie ep Seno Amb abr R Tey“ leNten 
ur taquainn 
[LOU AA ae oe co) a Oe Lae <HO eae +O 
Jo toquiayy SS SSNS e316 MAS Set OS eee SS OR PRSQ WS coig WR OM WHSOQ HA 2S 
TOU = is = S = 
| HOU - Ee se eee ee | eee eee ase Se 


JO Toquin Ny 


241 


J. J. SIMPSON 


TABLE L—(continued). 


Aouon boty - o 


= 
TRIS] > 
jo gaquiny 2) S 
~ 
= —— ——— — | 
- | 
eutou JO 
: eS ~ 5 x A = x ~ =z | 
[CULIO N “PLOT AA, > 
Toy A 
eu tH a Ve) st oe) fe] st Ne) a We) 


UL Staq Ute JY 
wortsod yore 
url faqmunN 
CE EE ee 


AN FA ANAK ANNAN WH 


Aouonba.ty oq 4 
[aml h 
UTRISVICT > “4 
aa © S 
oO qaquInN 
J aN A a 
RULIOUG W LO ‘S 
U Lv Ssh Ee x Yo oS Sl Ss a eS se| 


[BULION “PLOT AK 


TOUAUN Uh ee = as | 
UL SIOQUOyY z é | 


9 
5) 
4 
6 


N ioe) 


uorisod yora | 
Ur JaquuN 
eee SS... 


BN AN ANRwe Fe OM 


Aouonbeady 


ureLseiq, 
jo raquin yy 


Aouonbedy = a 
WIBLSRL(T is was 
jo raquunuy A OX 
[eULLoUq YW «0 . = ke 
‘eumoy qroym | “ S 7 hem aS se x <a 
[OGM : = 
UL SIOquayy ae i = wa So Bch SS 2 Sh = Sh 
| uortsod yowo | eerie a ee eh Rie cok asi ane era 
|_ ur equim 
| touAA eee 1 Saints <0 
| Jo zaqmeyy SEES HONS NSIS ISVS cag SS SS re oars 
HOUMA 4 = > 4 = 
jo coquan yy Bi a = mF She 5 eae 


[euadouqy 10 
[BUOY “ETOU AA. 


[PROG LANN 
UL SIOq WOT 
uortsod yore 
UL TaquinN 


Aouonbaay 


N 
A 
A 
A 
A 


i s (=) lan) pro) 


See ANN CDOO0O FAN MAN 


UIRISVL(T 
jo zoquin 


CI 


[euLtouq Fy 10 
[BUI N AOU A 


ROTA 
UL SOQ UIOTT 


uortsod ToRe 
UL JoquInN 


ae NAM ANNA NN TS 


Ou. ae Sones Ke) 
jo daquieyy BS SEO Seeks 
TZOUIM [Psat ts = eo iss 
= = 


JO TOQuan Ny 


| 


242 Contribution to a Statistical Study of the Crucifere 


IV. STATISTICAL. 


The analysis which we have given of 1813 flowers is sufficient to show that the 
idea of a definite fixed number of sepals in the calyx, of petals in the corolla, of 
stamens in the androecium or of carpels in the gyneecium of cruciferous plants is 
not upheld by an examination of a large number of flowers of this species. In 
less than 1 per cent. of the flowers examined there was an increase* or decrease 
in the number of sepals in the calyx; in less than 1 per cent. there was also an 
increase or decrease in the number of petals in the corolla, but in 2 per cent. there 
was an increase in the number of stamens in the andreecium, while in 22 per cent. 
there was a decrease in the number. 


Since then the number of sepals, petals and stamens is not absolutely fixed for 
any of the organs it becomes necessary now to consider whether the number of 
members in one organ is related to the number in the others. 


As has already been pointed out we have not only to consider organs as a 
whole, but, in the case of the calyx and the andrcecium, the constituents of these 
organs, owing to the fact that these organs are each divided into two separate 
whorls which are inserted at different levels and are placed in directions at right 
angles to one another. 


Further, a special study has been made of the various positions in androecium 
to ascertain to what extent bilateral symmetry may be regarded as an inherent 
character of the flower under consideration. 


By this means also it seems that some defimite information might be obtained 
with regard to the perplexing and, at present, hypothetical theory of chorisis, the 
reasons for the existence of which have been summarised on p. 219. 


The statistical part has been divided into two sections: 
(1) astudy of the Means and Standard Deviations, and 
(2) a study of the Correlation Coefficients. 


1. Study of the Means and Standard Deviations. — 


Although it is obvious from the analysis of the data under consideration that 
the numbers given for the botanical floral formula, namely, Calyx—4, Corolla—4, 
Stamens—6 and Gynzcium—2, are the nearest integers, it is not at all certain 
from a mere inspection of the tables whether the actual means deviate from this 
number in the direction of excess or deficiency. 


* Where chorisis of a sepal or petal has resulted in two or more distinct individuals we have regarded 
each of these as a distinct sepal or petal in recording the numbers. This method is natural however 
inasmuch as it is the only means by which we may possibly trace reduplication of parts. 


J. J. SIMPSON 243 


Consequently the mean and standard deviation for each of the organs and its 
constituents have been calculated and these are given in the following table: 


TABLE M. 
Means and Standard Deviations of the Number of the Organs and their 
Jonstituents. 
es : ee es — ed oo 
; : | Coefficient 
Organ Constituent Member ae Sh cn of | 
| ae Variation 
Calyx ne. fe — -- 3°9796 vaya 67415 
i Outer whorl — 1°9757 "1979 10°020 
5 Inner whorl — 2°0039 1304 6°507 
Corolla... igs — _ 3°9520 *3523 8914 
Andrecium ... — -- | 5°8092 "7567 13025 
Ss Outer whorl — | 199570 | :2704 13°817 
“ = “Stamenl | 9840-1602 «=| «16-280 
Fe —_ —Stamen 2 ‘9713 | ‘1915 19°715 | 
Fs Inner whorl | — | Sane 76588 | 17077 | 
: _ | Stamens 3.4 | 1:9950 ‘2728 | 13674 
me == | Stamens 5.6 | 1°8627 | ‘4863 | 26°107 


(1) The most obvious result which is revealed by these constants is the fact 
that in all cases (except the inner whorl of calyx) the actual mean of the organs 
is less than the recognised typical number, thus : 


The mean number for the calyx is 3979 instead of 4. 

The mean number for the corolla is 3°952 instead of 4, 

The mean number for the andrecium is 5°809 instead of 6. 

(2) The inner whorl of the calyx shows the smallest departure from the 
accepted typical number, namely, 2:004 instead of 2. 


Let us, however, test how far the differences in the character of the analogous 
parts are significant by ascertaining the Probable Error of the difference of the 
means of the characters. 


TABLE N (1). 
I. Constituents of the One 
- 7~— 
| Maan. stars 
Constituent |} Number — Deviation 
Outer whorl | 19757 | +1979 
Inner whorl | 2:0039 "1304 
| 


The difference here is D='02813 and the a eratis error of the difference 


ie = 0037; thus the value p= 76. The difference is therefore clearly 
He D 


m 


significant. 


244 Contribution to a Statistical Study of the Crucifere 
It is worthy of notice that the outer whorl of the calyx is more variable than 
the inner whorl and that it possesses on an average fewer sepals. 
TABLE N (2). 
Il. Members of the Outer Whorl of the 


Andrecium. 


Mean Standard 
Number Deviation 


Member 


‘9718 “1915 


| 
| 
| 
Position 1 
| Position 2 


"9840 "1602 | 


The difference here is D= ‘01268 and Ep =:00391; the value —— 
is Di 


Thus when the two positions of the outer whorl of the andrcecium are taken 
into consideration, a probably significant difference is found between the means of 
the distribution of the parts of this whorl. Now in position number 1 there is 
a greater approach to the accepted type owing to the fact that when the com- 
ponent of one of the positions of the outer whorl was found to depart from the 
accepted type, the other position was selected as the starting point for the 
orientation of the flower and was called position number 1. It is all the more 
noteworthy that the deviation for position 2 is not in the direction of greater but 
of lesser frequency and the variability of position 2 is greater. We have thus 
again a reduction in the value of the type with greater variability. 


TABLE N (3). 
III. Members of the Inner Whort of the 


Andrecium. 


Mem Mean Standard 
Cen Number Deviation 


Position 3.4 ... | 1°9950 2728 | 
| Position 5.6 ... | 1°8627 -4863 


Here the ratio eee is nearly 15 and therefore there is quite a significant 


= (m,—™M2) 


difference between the means of the distributions of the two members of the inner 
whorl of the andreecium. In both cases the tendency is towards a suppression of 
functioning stamens rather than an increase, together with greater variability 
in the case where the reduction from the accepted type is more marked. 

This difference in variability is, in the main, real and is not due to the arbitrary 
selection of the 3.4 position. This will be evident from a study of Table XIX. 
It will there be seen that there were 1754 cases where two stamens occurred in 


J. J. Srmpson 245 


one of the positions of the inner whorl of the andreecium. This is the number 
in the accepted type, and thus there is no variability. What is the nature of the 
distribution of the stamens in the other position (Table XVIII)? It is as follows: 


1 | 2 | 3 4 | 


| 
287 | 1436 | 26 | 5 | 1754 


The mean for this array is 18569 stamens, with a variability of ‘4116. Thus 
when there is no variability in one position of the inner whorl of the androecium 
there is a large variability in the other position. 


Similarly we find the following distribution for position 2 in the outer whorl 
of the andreecium when position 1 is of the accepted type, 1.e. shows no variability. 


57 1708 1 1766 


The mean for this array is ‘9683 with a variability of 1784. Again therefore, 
when there is no variability in position 1, there is a reduction of type in position 2 
with great variability. 


2. Study of the Correlation Coefficients. 


For the purposes of this study a number of correlation tables have been 
prepared and as the results of these will have to be considered under different 
groupings it seems advisable to tabulate them, and insert them consecutively. 
The system which has been adopted to facilitate reference is to commence with 
the outer whorl of the calyx and consider all cts relations with the other whorls of 
the flower passing from the outside inwards; following this comes the inner whorl 
of the calyx and its relations with the other constituents of the flower from the 
outside inwards and so on. 


The following table shows the characters studied and the correlation coefficients 
found. 


In order to make the comparison of the various correlations as complete as 
possible it will be necessary to consider each constituent or organ with all the 
other constituents or organs and to avoid overlapping as far as possible. The 
most natural method would be to commence either with the outermost constituent, 
namely, the outer whorl of the calyx, or with the innermost constituent, namely, 
the inner whorl of the andreecium. For reasons of a morphological character, which 
will be seen later, the inner whorl of the andrceecium has been chosen as the 
starting point. : 


Biometrika x 32 


246 Contribution to a Statistical Study of the Crucifere 


TABLE O. 


Correlation Coefficients between the Number of Various Organs and their Constituents. 


Table Yr. 
The outer whorl of the calyx and the inner whorl of the calyx... is xa I 1957 
The outer whorl of the calyx and the corolla ae w II "7275 
The outer whorl of the calyx and the outer whorl of the andrcecium ae a Ill “5886 
The outer whorl of the calyx and the inner whorl of the andreecium is ato IV 2613 
The outer whorl of the calyx and the andrecium ... She Fae ee ids Vv “4371 
The inner whorl of the calyx and the corolla Ae via a VI 2476 
The inner whorl of the calyx and the outer whorl of the andrecium ase io VII 3229 
The inner whorl] of the calyx and the inner whorl of the androecium wats moa). WELL +3905 
The inner whorl of the calyx and the andreecium ... oh ee wee Ben IX “4592 
The calyx and the corolla 2 aes ae see noe eA x 6926 
The calyx and the outer whorl of the andreecium ne re es Bsc XI "6245 
The calyx and the inner whorl of the andreecium ... a ute Me, ae XII -4014 
The calyx and the andrecium ... nes ore ae a Boe || 2-SILILIL 5721 
The corolla and the outer whorl of the ‘andrescium ante ame ES ae a XIV 4762 
The corolla and the inner whorl of the andrcecium ... ats Ape are hee XV 1773 
The corolla and the andreecium ae XVI 3174 
The outer whorl of the andreecium and the i inner whorl of the androecium: XVII "1984 


The inner whorl of the andreecium, Posie 3.4 and the inner whorl of the XVIIT| 4305 
andreecium, position 5.6... a 


The outer whorl of the calyx and the inner whorl of the andrecium, position 32 oxen 4646 
The outer whorl of the calyx and the inner whorl of the andreecium, position 5.6 XX 2134 
The inner whorl of the calyx and the inner whorl of the andreecium, position 3.4 | XXI 4634 
The inner whorl of the calyx and the inner whorl of the andreecium, position 5.6 | XXII | 2519 
The corolla and the inner whorl of the andrecium 3.4... ses is ... | XXIIT | +2558 
The corolla and the inner whorl of the androecium 5.6 XXIV | 0903 
The outer whorl of the andrecium and the inner whorl of the andreecium, XXV | -2661 


position 3.4 
| The outer whorl of the ‘andreecium and the inner » whorl of the andreecium, tXXVI +1539 
position 5.6 soc 


(a) The inner whorl of the andrecium. 

From the standpoint of the systematic botanist the most anomalous constituent 
of the cruciferous flower is the inner whorl of the andrcecium, inasmuch as in each 
of the positions where one stamen would naturally be expected, the presence of 
two is regarded as typical. It has been explained in a previous section that 
botanists now usually regard this anomaly as having arisen by collateral chorisis 
from what was originally a single stamen in ancestral forms. For the sake of 
conciseness and in order to avoid unnecessary repetition the following abbreviations 


have been used in Tables P—X. 


O. W. Ca. = Outer whorl of the calyx. 
I. W. Ca. = Inner whorl of the calyx. 


Ca. = Calyx. 

Co. = Corolla. 

O.W. A. = Outer whorl of the andrcecium. 
IW. A. = Inner whorl of the andreecium. 


A. = Andreecium. 


= 


J. J. Simpson 247 
The following Table, P, gives the correlation coefficients between the I.W. A. 


and the other constituents or organs of the flower in order of position. 


TABLE P. 
I. W. A. and the other Constituents. 


Constituent or Organ Table Correlation 
OMWaiCan... As IV 2613 

Hee Wit Cari ne Wang | "3905 

Ca. site ae XII 4014 

Co. Se oe XV | ‘17738 | 
O. W. A. ... ves XVII 1984 | 


The highest correlation between the inner whorl of the andrcecium and the 
other constituents or organs is that with the calyx; next in order come the inner 
whorl of the calyx, the outer whorl of the calyx, the outer whorl of the andreecium, 
and lastly the corolla. In other words, we should be better able to predict the 
number of stamens in the inner whorl of the andreecium from the number of 
members in the calyx than from the number of members in any other constituent 
or organ. 


(b) Relations between the organs themselves. 


Having thus discussed the inner whorl of the androecium with the other organs 
and constituents it might lead to some useful result if we proceed to determine 
the “organic correlation” existing between the various organs themselves. In 
this connection we have to consider the calyx, the corolla and the andreecium, and 
for this purpose the correlation Tables X, XIII and XVI have been prepared. 
The character which has been selected for this study is the number of members 
in each organ. 


The following Table (Q) shows the results obtained : 
TABLE Q. 


Correlation Coefficients between 


Ca, and Co. ... | "6926 
Ca.and\ A.” ... | “5721 


| 
| Covand As... || “3174 
| ee | 
(1) The calyx and corolla are much more highly correlated to one another thau 
is either of these with the andreecium. In other words, the two protective organs 
of the perianth are more highly correlated to one another than is either protective 
organ with the male reproductive organ. It is further evident that (2) the calyx 
is much more highly correlated to both the corolla and the andreecium than are the 
two last named to one another. From (1)it may be concluded that, on an average, 


32—2 


248 = Contribution to a Statistical Study of the Crucifere 


an increase or decrease from the accepted typical number, namely four, of petals in 
the corolla is accompanied by an increase or decrease in the number of sepals 
in the calyx; while from (2) an increase or decrease in the number of stamens in 
the andreecium will be accompanied, on an average, by a greater increase or decrease 
in the number of sepals than in the number of petals. 


(c) Relations between the constituents of organs. 


The constituents of (1) the calyx and (2) the andrcecium will now be con- 
sidered. ; 

(1) Calyx. The outer and inner whorls of this organ are inserted at different 
levels and have a decussate arrangement, so that, although the organ as a whole 
is protective in function, the two whorls actually help to enclose the flower at 
right angles to one another. The correlation between these two whorls is an 
extremely low one, namely, 1957 (Table I), in other words, an increase or decrease 
in the number of sepals in either of the whorls of the calyx is associated only in 
a very small degree with an increase or decrease in the number of sepals in the 
other whorl. Or again it may be expressed thus, the two whorls of the calyx 
vary to a great extent independently of one another. This statement should be 
taken in conjunction with that made on p. 244 with regard to their Means and 
Variabilities and should also be borne in mind when the correlation between 
these two constituents and the other parts of the flower are discussed below 


(see Tables R and 8). 


(2) Andreciwm. This organ is also composed of two whorls, an outer and 
an inner inserted at different levels. Its function is of course reproductive. 
The correlation between the two constituents is very low, namely, ‘1984 (see 
Table XVII), and is almost the same as that between the two whorls of the calyx. 
The inner whorl of the andrcecium shows greater variability than the outer whorl 
and tends to vary independently of this latter constituent, just as in the case of 
the two whorls of the calyx. 


Having thus considered the organs per se, let us now compare the correlations 
between each individual constituent or organ and all the other constituents or 


TABLE R. 
(d) Correlation Coefficients between the Outer Whorl 
of the Calyx and 
2nd Component Table 1, 
UI Ne OP IL "1957 
Co ner 11 7275 
O. W. A Ill 5886 
Te Wee A: IV 2613 
A Vv 4371 


J. J. SIMPSON 249 


organs. For this purpose it will be necessary to tabulate the results in series and 
consequently it might be well to start with the outermost constituent of the flower, 
namely, the outer whorl of the calyx, and tabulate the correlation coefficients 
passing inwards to the andreecium. The inner whorl of the calyx will next be 
taken in relation to the other constituents and so on. 


From the above table it will be seen that the outer whorl of the calyx is most 
highly correlated with the corolla; it is also highly correlated with the outer whorl 
of the andrcecium but much less so with the inner whorl of the andrcecium. 


TABLE §&. 


(e) Correlation Coefficients between the Inner Whorl 
of the Calyx and 
2nd Component | Table rs 
COs. VI 2476 
O. W. A. VII "3229 
Wie Ae VItl “3905 
TNS IX 4592 


The low correlation between the inner whorl of the calyx and the corolla is due 
to the close adherence of the former to type, that is, there is very small variability. 
TABLE T. 
Correlation Coefficients between the Calyx and 


(f) 


2nd Component Table rs 

Come. x "6926 
O. W. A XI 6245 
IW. A XII 4014 
AS sts XIII “5721 


There is a higher degree of correlation between the two organs of the perianth 
than between the calyx and the andreecium. The high correlation between the 
calyx and the outer whorl of the andreecium is mainly due to the high value 
obtained for the correlation between the outer whorl of the calyx and the outer 
whorl of the andreecium. 


TABLE U. 
(g) Correlation Coefficients between the Corolla and 
2nd Component Table is 
OR Wards XIV "4762 
I. W. A. XV 17738 
Aes XVI 3174 


250 Contribution to a Statistical Study of the Crucifere 


The corolla is much more highly correlated with the outer whorl than with 
the inner whorl of the andrcoecium, and the correlation between the corolla and the 
andreecium as a whole is not very great. 


A comparison of Tables T and U shows that there is a much greater correlation 
between the calyx and the andreecium and its two whorls, than between the corolla 
and the same constituents. 


So far we have considered the relationships between the different parts of the 
flower from the outside inwards, but when we examine these relationships, taking 
the inner whorls as our starting point, some new aspects of* the problem become 
manifest and, as these have been of great value in the interpretation of the results, 
it has been considered advisable to tabulate them thus: 


TABLE V. 


(h) Correlation Coefficients between the Inner Whorl 
of the Andrecium and 


2nd Component Table is 
Can ic ase XII “4014 
TW C aremene VIII “3905 
OMWis Cannes IV 2613 
Oo AW tcAln tee XVII 1984 
Co. =: mate XV ‘1773 
TABLE W. 
(2) Correlation Coefficients between the Outer Whorl 
of the Andrecium and 
| 2nd Component | Table i: 
| 
Carex: 586 XI 6245 
[OW Cosme, Ala 5886 
Cones. bee XIV “4762 
[iti Rg Cay ee VII 3229 


A comparison of Tables V and W shows that the correlations between the 
outer whorl of the andrcecium and the other components are higher than for the 
inner whorl of the andrcecium, except in the case of the inner whorl of the 
calyx. 


J. J. SIMPSON 251 


TABLE X. 
()) Correlation Coefficients between the 
Andrecium and 
2nd Component Table Ts 
Ob aor oe XIII 5721 
Te WieCare ere IX "4592 
OW Can ce- V 4371 
Co. ... ie XVI 3174 


This table shows that when the androecium is considered as a whole it 1s mast 
highly correlated with the calyx and least correlated with the corolla. 


V. MORPHOLOGICAL SIGNIFICANCE OF THE STATISTICAL RESULTS. 


It is quite clear from the tabulated results that there is a definite departure 
from the usually accepted cruciferous structure in a very large number of the 
flowers of Lepidium Draba which have been examined for this study. This does 
not obtain merely in any one organ or constituent but in all the organs and 
constituents, although not to the same degree in each. 


The statistical results will now be examined from the standpoint of the 
botanist in order (a) to note their morphological or genetic significance and 
(b) in order to see whether these figures throw any light on the evolution of this 
cruciferous plant. 


It is almost axiomatic to state that the “purpose” of a flower is a purely 
reproductive one and that therefore its existence is justified only in so far as it 
serves to reproduce its kind. But not all the parts of a flower are solely repro- 
ductive in function. Each individual consists of two parts, (1) Reproductive, 
(2) Protective. (1) The reproductive organs are the gynecium (2?) and the 
andreecium (¥), while (2) The protective organs (perianth) are the corolla and 
the calyx. 


One of the organs of the perianth, namely the corolla, is still further specialised. 
The calyx consists of four sepals, green in colour, whose sole function is to protect 
the flower when in the bud, and in many cases these are reflexed immediately after 
the flower has opened up, and are of no further importance to it. On the other 
hand the petals though essentially sepal-like in structure, in this as in the great 
majority of flowers, are not green but of some other colour. In the species under 
consideration they are white. Now although the petals are of great importance 
in protecting the reproductive organs while in the bud their utility does not cease 


252 Contribution to a Statistical Study of the Cruciferae 


when the flower opens but, along with small nectaries at the base of the stamens, 
serve as an attraction for insects whose visits are essential for cross-fertilisation. 


The reproductive organs of what is regarded as the typical cruciferous flower 
consist of (1) the gynaecium which is composed of two carpels and (2) the andreecium 
which is composed of six stamens. The stamens are delicate structures and do not 
hold an isolated position in the flower. When in the bud and immature they are 
subject to external influences, for example, (1) they might be shrivelled up by the 
heat of the sun, (2) they might be blasted by rain or wind or (3) they might be 
attacked by herbivorous insects, so that the protective perianth plays an important 
part in flower economics. Now what does an increase in the number of stamens 
imply? It is obvious that if the number of stamens is increased the total volume 
occupied by the reproductive organs is increased and consequently a tax is put 
upon the protective organs if they are to fulfil their function adequately. If the 
perianth does not respond to this tax from space considerations, the reproductive 
organs stand a small chance of ever fulfilling their function, so that one would 
naturally expect that variation of some kind in the perianth would follow variation 
in the reproductive organs. 


Another important point which must never be lost sight of when interpreting 
the statistical results is the symmetry of the cruciferous flower. The calyx consists 
of two whorls each with two sepals; the corolla of one whorl of four petals and the 
andrceecium of two whorls of stamens, the outer having two members and the inner 
four members (see Plate I, fig. 7). Consequently a cruciferous flower is bilaterally 
symmetrical only on that vertical plane which passes through the division wall of 
the carpels, between each of the pairs of stamens in the inner whorl, between two 
petals on either side and through the middle of the outer pair of sepals, This 
plane may be referred to as the “plane of symmetrical division.” Owing to the 
fact that the corolla consists of only one whorl, the outer whorl of the calyx 
corresponds in position to the inner whorl of the andrcecium, and the inner whorl 
of the calyx to the outer whorl of the andreecium. 


From a study of the Means and Standard Deviations of the various organs and 
constituents we arrive at the following conclusions : 


Calyx. (1) The greatest approach to constancy in number in the whole 
flower is in the inner whorl of the calyx. 


(2) There is a significant difference between the means of the two whorls. 


(3) There is much greater variability in the outer than in the inner whorl of 
the calyx and on an average it possesses fewer sepals. 


(4) There is a tendency towards a reduction from type in the number of 
sepals in the calyx. 


Corolla. (5) There is a tendency towards a reduction from the accepted 
typical number in the number of petals in the corolla. 


J. J. Srpson 253 


Andrecium. (6) There is a significant difference between the means of the 
distributions of 
(a) the members of the two whorls of the androecium, 
(b) the members of the two positions in the inner whorl, 


and (c) the members of the two positions in the outer whorl. 


(7) From whatever axis we view the andreecium as an organ it is distinctly 
asymmetrical in the distribution of its functioning stamens. 


(8) There is much greater variability in the inner whorl than in the outer 
whorl of the andrceecium. 


(9) In both positions in the inner whorl of the andrcecium there is a tendency 
towards a reduction from the accepted typical number of stamens and in the 
position where this is most marked there is the greatest variability. 


The interpretation of these results is not at first sight very evident. 


Why should there be a tendency towards a reduction in the number of 
members in the different organs of the flower and why should this tendency 
be most marked in the inner whorl of the andreecium? As has already been 
pointed out all the flowers examined were taken from a single plant which gave 
rise to new stems by means of buds on the roots. May this tendency to reduction 
in the parts of the flower whose function is sexual reproduction not be an expression 
of a tendency towards an elimination of sexual in favour of vegetative reproduction ? 
Another phenomenon which lends support to this hypothesis is the fact that in 


this plant the percentage of “ pods” which attain maturity is extremely small. 


Whether there is or is not a tendency towards vegetative reproduction, may 
we not also have here a harking back towards an ancestral form in which the 
number was less than the at present accepted typical number? In fact one would 
expect that if the present constitution of the inner whorl of the andreecium had 
been most recent in development, reversion would first take place in it, and 
conversely one might reasonably conclude that since this whorl shows greatest 
variability, and most marked tendency to reduction in the number of members, it 
is more than probable that its present constitution was arrived at by an increase in 
number from a more primitive type. 


Let us now examine the deductions made from a study of the correlation 
coefficients and see if they have any morphological interpretation. 


(1) The calyx and corolla are more highly correlated with one another than is 
either of these with the andrcecium. 


(2) The calyx is more highly correlated with the andreecium than is the corolla. 
In other words, the two protective parts are more intimately associated in increase 
or decrease with one another than is either of these with the male reproductive 
organs, and further the calyx which is solely protective in function is more 

Biometrika x 33 


254 Contribution to a Statistical Study of the Cruciferae 


intimately correlated with the male reproductive organs than is the corolla which 
serves as an attraction for insects as well as a protective covering of the bud. 


(3) The two whorls of the calyx are not highly correlated, i.e. they vary inde- 
pendently of one another. 


(4) The two whorls of the andrcecium also are not highly correlated. Morpho- 
logically this means that when there are two constituents in one organ, each having 
the same function, they may vary independently of one another, so that although 
an increase or decrease in the number in either may be correlated with an increase 
or decrease in the number in any other constituent of the flower, the same does 
not hold true with regard to the two constituents. 


(5) The outer whorl of the calyx is most highly correlated with the corolla, 
next with the outer whorl of the andreecium and lastly with the inner whorl of 
the andrecium. The reason why the outer whorl of the calyx is more highly 
correlated with the outer whorl than with the inner whorl of the androecium is 
not at first sight very evident, but may be explained on the basis of its protective 
power. The members of the outer whorl of the andrcecium lie in a plane parallel 
to that of the outer whorl of the calyx, and are much more widely separated in 
this plane than are the members of the inner whorl of the andreecium. Con- 
sequently any increase in the number of stamens in the outer whorl would involve 
a much greater increase in volume within the flower than a corresponding increase 
in the number of stamens in the inner whorl. Thus we are not surprised to find 
that such an increase in the outer whorl of the androecium is more intimately 
associated with an increase in the outer whorl of the calyx than a corresponding 
increase in the inner whorl of the andrcecium would be. 


(6) There is very low variability in the inner whorl of the calyx and it is 
almost equally correlated to the two whorls of the andreecium. The morphological 
explanation of these facts follows as a corollary to that given above. 


(7) The calyx is much more highly correlated with the andrcoecium as a whole 
and with its two whorls than is the corolla. 


As we have already said the calyx is the predominantly protective organ and 
consequently this higher correlation has a physical basis. The corolla being partly 
attractive does not enter so closely into space economics. 


(8) The outer whorl of the andrcecium is more highly correlated with the 
other components of the flower than is the inner whorl of the andreecium. This 
again follows on the basis of space considerations. Any increase in the number 
of members in the inner whorl of the androecium does not involve so radical a 
change in the volume of the flower as does a corresponding increase in the outer 
whorl of the andrceecium. 


J. J. SIMPSON 255 


VI. VARIATION IN THE GYNACIUM. 


So far we have not considered the gynzcium on account of the small number 
of variations which occur in that organ and from the fact that these do not lend 
themselves to statistical treatment. 


The gynecium consists typically of two carpels which are flattened in a vertical 
plane parallel to those containing the pairs of stamens in the inner whorl of the 
andreecium. The thin partition wall separating the two carpels therefore stands 
at right angles to this plane. 


Now when we examine the different types of variations in the structure and 
number of the carpels we find the following: (1) a single carpel, (2) two carpels 
(typical), (3) three carpels, (4) four carpels, (5) two sets of two carpels within 
a single perianth, (6) two sets of two carpels within separate perianths but on one 


pedicel. 
Let us now proceed to examine each of these in some detail. 
(1) The gyneecium consists of a single carpel (see Figs. LXXXVIIT—XCII). 


In all these cases, except LXX XVII, as will be at once seen by reference to the 
figures, the suppression of a carpel is accompanied by the suppression of some of 
the members of nearly all the other organs thus: 


In LXXXVIII two petals are absent and one stamen is aborted. 
In LXXXIX one sepal, two petals and two stamens are absent. 
In XC, XCI and XCII all the organs are deficient in members. 


A noteworthy phenomenon in this respect also is that the suppression of 
members which accompanies the suppression of a carpel is usually in the vertical 
plane which passes through the plane of separation of the carpels. 


(2) The gynecium consists of two carpels. 


This is the accepted typical structure and the statistical study deals with these 
in detail. 


(3) The gynecium consists of three carpels (see Figs. XCIII and CII). 


When three carpels occur in the gynecium they are never found co-laterally, 
Le. the additional carpel is never found with its origin at the side of a carpel, but 
always arising from the plane of separation, which is in the plane of greatest 
variability. 


(4) The gynecium consists of fowr carpels (see Fig. Cl). 


Just as in the previous case the increase in the number of carpels takes place 
in the plane of separation of the carpels—one on either side, so that a cruciate 
structure is found. A reference to Fig. CI will make this clear. In both of these 
groups it will be evident that an increase in the female reproductive organs is 

33—2 


256 Contribution to a Statistical Study of the Cruciferae 


associated not only with an increase in the male reproductive organs but also in 
an increase in the protective organs or perianth. 


(5) The gynecium consists of two sets of carpels within a single perianth. 


This is rather an anomalous group but is extremely interesting inasmuch as it 
contains a series of annectant forms linking group 2 to group 6. What we actually 
have here is a complete reduplication of the reproductive organs encased within 
a single series of protective organs. In some of the flowers examined with this 
structure it was rather difficult to determine the orientation owing to a torsion 
of the thalamus, but in the types figured on Plates IX and X (Fig. XCIV and 
Figs. XCV et seq.) the mode of origin of these is quite evident. Several 
important observations on these forms may be stated. 


(a) There are really two complete sets of reproductive organs and in one case 
see Fig. each of these is of the typical cruciferous structure. 
Fig. XCVII h of th f the typical f truct 


(b) Increase in the number of the reproductive organs is accompanied by an 
increase in the number of members in the protective organs. 


(c) The increase in the number of members of the reproductive organs is for 
the most part in the plane of division of the carpels, in other words, in the outer 
whorl of the calyx and its associated petals. 


(d) This is also the plane along which the separation of the reproductive 
organs has taken place. 


(e) This plane is the one which we have already shown in the statistical part 
to be the plane of greatest variability. 


(6) The gyncecium consists of two sets of two carpels within separate perianths 
but on one pedicel. 


In this group we reach the limit of variability in the material examined. In 
place of a single flower consisting of calyx, corolla, andrcecium and gynecium we 
actually find two complete sets of all these organs, on one pedicel (see Figs. CITI— 
CVIIT), while in one case (Fig. CIII) each of the two flowers has the typical 
cruciferous structure, so that were each of these separately examined it would 
undoubtedly be regarded as a normal flower. Yet we must bear in mind that, 
botanically considered, one flower and one flower only arises from a pedicel. Were 
this, therefore, an isolated example, and if no annectant forms existed, the departure 
might well be regarded as a “mutation,” but a consideration of the numerous 
variations which we have already considered, taken in conjunction with group 5, 
only serves to emphasise the fact that “the vertical plane which passes through 
the partition wall of the two carpels and consequently separates the individuals 
of the pairs of stamens in the inner whorl and passes through the centres of the 
sepals of the outer whorl of the calyx is a plane along which this flower is in a 
state of flux and is the plane in which it is probable that the flower has changed, 
and is still changing, from some quite different ancestral form.” 


J. J. SIMPSON 251 


VII. SUGGESTIONS FOR FUTURE STUDIES IN THIS PLANT. 


It must be very obvious to anyone who has perused this paper that the results 
which might be obtained from a study of this plant are by no means exhausted. 
An attempt, however, has been made to interpret the variability in its flowers, 
both from a morphological and an evolutionary standpoint. Studies of a different 
nature might be undertaken in order to test the results obtained, e.g. : 


(1) What is the degree of fertility in the flowers of this plant? For this 
purpose it would be necessary to find the percentage of flowers which produce 
fertile seed. 


(2) What are the variants, if any, which are associated with infertility ? 


(3) What are the characters of the flowers which are produced from the seeds 
of the different variants? If seeds selected from the different variants were grown 
separately and self-fertilised, one could trace the variations in the flowers of the 
next generation and see to what extent the different variations were transmitted. 
This study is capable of much elaboration and is one which would be fraught with 
great possibilities. It seems to involve a satisfactory method of determining how 
far these variations are concerned in plant economics, and also to what extent 
they have been instrumental in the evolution of the Order Crucifere. 


EXPLANATION OF FIGURES 8, 9 AND 10. PLATE I. 


FIGURE 8. 
(a) Typical stamen (outer whorl). 
(b) Stamen with half-length filament and complete anther (outer whorl). 
(c) Typical stamen (inner whorl). 
(d) Non-chorised stamen with two complete anthers (inner whorl). 
(e) Stamen of inner whorl with two complete anthers but only chorised in the upper half. 


FIGURE 9. 


) Aborted stamen of outer whorl, i.e. filament with no anther. 

(b) Absence of stamen in outer whorl. 

(c) Full-length filament in inner whorl with no anther. 

(d) Half-length filament in inner whorl with complete anther. 

(e) Half-length filament in inner whorl with no anther, 

(f) Non-chorised stamen in inner whorl with half-length filament but with two complete anthers. 


FIGURE 10. 
) Normal sepal. 
(b) Sepal divided almost to the very base. 
(c) Sepal completely divided into two distinct sepals. 


(d) Sepal absent. 
(e) Normal petal. 
(f) Petal divided almost to the very base. 


(g) Aborted petal. 
(h) Petal absent. 


Outer Whorl of 


Andrcecium. 


Inner Whorl of Calyx. 
Number of Sepals 


Contribution to a Statistical 


bo 
On 
o.8) 


TABLE I. 
Outer Whorl of Calyx. 


Number of Sepals. 


0 1 

1 3 

y 1803 | 

8 O | 

J 6 
Totals 


TABLE IIT. 


Outer Whorl of Calyx. 
Number of Sepals. 


ab Totals 
3 5 0 5 
5 Hh 1 85 
Bs y) 1713 | 
ES Mg 3 
a 9g } 7a 
<qiol est ‘ 
7, |Totals 1813 
TABLE V. 
Outer Whorl of Calyx. 
Number of Sepals. 
7 = s =a = | 
Totals 
ory 38 
L 5 367 
S| 6 1367 
3 | c 19 
Se 9 1 
Se at0 2 | 
= | 12 Om 
af | 13 1 
|. 4 1 
18 2 | 
Totals 1813 


Study of the Crucifere 


Corolla. 


Inner Whorl of Andrcecium. 
Number of Petals. 


Corolla. 
Number of Petals. 


Number of Stamens. 


TABLE II. 
Outer Whorl of Calyx. 
Number of Sepals. 


QD 
0 
1 
2 
y 
5 
6 
Totals 
TABLE IV. 
Outer Whorl of Calyx. 

Number of Sepals. 

0 dl 2 3 | 4 | Totals. 
Cel a Tel 
2 |= | 6 |.) 24s eae 
3 | 2] 992 | — | —} .994 
4 | 2 | 39:| 1399 | — |=") ago 
5 2 25]; 1 | — 28 | 
6 —|— 12) — | — 12 
Sees Tye | a 4 | 
9 1 1 

10 —|— os 1 | — 1 | 
1 f—|—| — 4a} 2 | 
Totals] 2 | 50 | 1753) 6 | 2 | 1813 | 


TABLE VI. 


Inner Whorl of Calyx. 
Number of Sepals. 


DAW Se *M*HS 


Totals 


S 
Nw 
w 
eS) 


4 {Totals 


Outer Whorl of 


Outer Whorl of 
Andrceecium. 
Number of Stamens. 


[=| 

oO 

ofS 
a x | 

ess 

BO 
a | 

@3 

aq 2 

I 

= 

Z, 

DB 

g 

ga 

ee 

eas 

23 

oH 

oO 

<q 2 

q 

fos} 

Z, 


TABLE 


VII. 


J. J. SIMPSON 


Inner Whorl of Calyx. 


Number of Sepals. 


cS) 


Q 
ov 


TABLE IX. 
Inner Whorl of Calyx. 


WOBANADAW 


of 4 


3 | 1803 


Number of Sepals. 


4 Totals’ 


Corolla. 


TABLE 
Calyx. 


Number of Sepals. 


| 7 
7 |Totals 


luner Whorl of Andreecium. 


Number of Petals. 


y 
L 


Inner Whorl of Andrcecium. 


ber of Stamens. 


Num 


Number of Stamens. 


> MA SS WH 


~ 


Totals 


259 
TABLE VIII. 
Inner Whorl of Calyx. 
Number of Sepals. 
g 3 4, | Totals 


o| 1 


| 
| 
| 
} 


Number of Sepals. 


a 
| 
aay ties 
= 
| 
1 3 | 1803 | 0 | 6 


TABLE X. 


Calyx. 


| 


'Totals} £ 


TABLE XII. 
Calyx. 


Number of Sepals. 


Inner Whorl of Andreecium. 


Andreecium. 
Number of Stamens. 


Number of Stamens. 


Contribution to 


TABLE XIII. 
Calyx. 


Number of Sepals. 


TABLE XV. 
Corolla. 


Number of Petals. 


Totals 


1 
Q 
oS 
J ie 
‘5 2) 
6 — | 
8 Ik 
9 Lem 
10 1 
VU 2 
Totals 
TABLE XVII. 
Outer Whorl of Andrcecium. 
Number of Stamens. 
: —. 

Totals 
eh ] 
5 wo 
S81 2 30 
es a 3 294 
icp 4 1440 
a a oO 28 
LSC BG 12) 
Sa| 8 A 
eS 9 1 
= © 
Boel ALO 1 | 
SAl ty 2 
= = 

Totals 1813 


Outer Whorl of 


Inner Whorl of Androecium 


Andrcoecium. 
Number of Stamens. 


aye 
Number of Stamens. 


Andrcecium. 
Number of Stamens. 


a Statistical Study of the Crucifere 


TABLE XIV. 


Corolla. 


Number of Petals. 


TABLE XVI. 


Corolla. 
Number of Petals. 


5 


6 |Totals 


_\Totalsy 4 | 3 | 18 34| 8 | 4 | 1813 
TABLE XVIII. 
Inner Whorl of Andreecium, position 3. 4. 
Number of Stamens. 
4 Totals 
ng 1 
i SuL7/ 
2 1446 
3 37 
4 0 
5 3 
6 0) 
ff 2 
Totals 1813 


J. J. SIMpson 261 


TABLE XX. 
Outer Whorl of Calyx. 


TABLE XIX. 
Outer Whorl of Calyx. 


Number of Sepals. 


Number of Sepals. 


Inner Whorl of Andrcecium, 
position 3. 4, 
Number of Stamens. 


position 3. 4. 
Number of Stamens. 


Inner Whorl of Andreecium, 


a 3 Totals 

los FS 
1 =. @ 0 1 
2 eS 8 i 317 
3 tion] 2 1446 
4 oe oie 37 
5 BS y| 4 
6 eS es 5 
7 BE oe 7 

nO 

1813 = Totals 
— be 


TABLE XXI. 
Inner Whorl of Calyx. 


Number of Sepals. 


4, \Totals 


TABLE XXII. 


Inner Whorl of Calyx. 


Number of Sepals. 


Gaieeatiy cor ies q 0 | 1 2 | 31] 4 |Totals 
=| 
Saar _ 25} a 
it || oe iis ea ee | 8 8] 0 
2 — 3 1750 | -— 1 1754 oR) a y 
Ba fe |e ha One et 32 
ep re pu |e Us ese WS 
i (ae ale | ot 2 Oe: |e 
ee ey ol Be hs 
7 2 2 hore) tab 
H cS 
Totals} 1 | 3 | 1803] 0 | 6 | 1813 3 4 
: A Totals 
TABLE XXIII. 
Corolla. 
Number of Petals. 
5 
a 
oO al SS, 
PE eecead 
aw & 2 
<a: 3 
Sa we] 4 
eins} A 5 
BS 5 : 
ah 2 6 
Fag) 7 
mn OA 
5 Totals 
S 
— bi — 
Biometrika x 34 


Inner Whorl of Andrcecium, 


position 3. 4. 
Number of Stamens. 


2 Contribution to a Statistical Study of the Crucifere 


TABLE XXV. 


Outer Whorl of Androecium. 


Number of Stamens. 


g 
3 
D ; 
= 
=| co) 
Bo g 
. @ 
“ls ae 
M 
Soa 
OB w 
ae) 
mer 
ee 
on v 
E a'g 
bolts 
3 
8 OG 
=} 
q 
ml 


TABLE XXIV. 
Corolla. 


Number of Petals. 


NWA AK O89 WHO 


6 | Totals 


TABLE XXVI. 


Outer Whorl of Androecium. 


Number of Stamens. 


s 
SP RAAB WON 
a 


a2] 


QD 


4 


3 | 4 |Totals } 
ee 
42 Some aa0 
1754 go g| 1 
9 as#| ? 
Fleeccts cul 
ae Seeds 
0) Sa ® 5 
2 cae | 6 
Fe seal. a7, 
Prt 4 Totals 


J. J. SIMPson 


PLATE I. 


Aborted 


Peduncle ap) 
A 


aa” r 


/ 
/ 

‘ 

t 

' 


a — Petiole 


\ 


Fig 3 
. A 


Via 
mene D 3 ane 
D' 

4 See ae, 

Cc : | 

fig 10 


Fig 1 


ies 


) 
0 2 
CTTTIATT thos 


lone} 
‘@ 


Fig 12 


384—2 


uribution to a Statistical Study of th 


265 


OO: © © 


=~ ow ~~ 

OOO IO, 
PS WZ NT 
OO: © © 
Ae, NN 


OO © © 


266 Contribution to a Statistical Study of the Crucifere 


& 


© 


AIOX II0x TIox ITXXXT IXXX1T XXXT 


aN YEN OAL Oda 
o* ’ ’ % 
Zz e ty *, 
it Fide < f “2. ON 
= a. ™: a ‘ 6, 
N xX ie OX + \ a % 
pila gy a Sete s 
Ba % Oo : stash 38 : 
ote A, ers ee 0 <> xi 
A oper 
5 . 


ee ee 


‘TOX OX 


vay, 
at ", 
% 


HIAXX'T ITAXX'T 


tt llliay, | 
Oh 4, 
. 


. 
yy o yy fe, 


TIAXXXT 


J. J. StmMpson 


AXXXT 


nnttlltyyy 
<8 4, 
: "4, 
¢, 
a 


<1 
+ 


YY 


‘XJ FLivIg 


TITA @LVId 


268 Contribution to a Statistical Study of the Crucifere 


CVIII A 
CVIII B 


Sree 


OOOO 

OOOO 
NNN 

O@™ © © 

ae 

SE Ne 


a 

> ‘it\ = 

2 peo tft) [Ee oe 
CF AW 


~ —=a~ 
@) ©) 
ee ot Nee 


ae 


CIV B 
CVI B 
OVIIB 


CIII A 

Clu B 
CVI A 
CVII A 


XCVI 


NOCHMALS UBER “THE ELIMINATION OF SPURIOUS 
CORRELATION DUE TO POSITION IN TIME OR SPACE.” 


Von O. ANDERSON, St. Petersburg, RuBland. 


1. Im Aprilheft der Biometrika, hat “Student” gezeigt*, daB das von Cave und 
Hooker vorgeschlagene Verfahren, den Korrelationskoeffizienten zweier oscillieren- 
der Variablen durch Berechnung erster Differenzen (also durch Ersetzung der 
Reihe a, 2,... %,, durch die Reihe: A’a, =x, — x, A’x,= a, — &3,... An. = Bn —4n) 
vom evolutorischen Element zu befreien, eine Verallgemeinerung zuliBt. Das 
Verfahren ist nimlich, streng genommen, nur dann richtig, wenn die evolu- 
torische Komponente durch eine lineare Gleichung darstellbar ist. Findet 
letzteres nicht statt, kann also, z. B., jener nur eine parabolische Gleichung 
hoherer Ordnung geniigen, so mu8 man zweite, dritte u.s.w. Differenzen nehmen 
(also statt A’s,, A’x,... nehme man A”z, = A’a,— A’x,, A” x, = A’x, — A’as, ... etc.) 
und danach Korrelationskoeffizienten berechnen. Letztere kénnen bald einen 
konstanten Grenzwert erreichen, der das gewiinschte Resultat darstellt. 


Unterzeichneter ist schon vor etwa 2 Jahren zu ahnlichen Schliissen gekommen. 
Durch von ihm unabhangige Griinde wurde er aber bis jetzt vom Drucke seiner 
diesbeziiglichen Schrift abgehalten. Da er bei seiner Untersuchung Wege ein- 
schlagt, die von denen des “Students” sehr verschieden sind, und auch zu manchen 
Schliissen kommt, welche letzterem unbekannt geblieben zu sein scheinen, so 
kénnte vielleicht eine kurzgefaBte Darstellung der wichtigsten Resultate seiner 
Untersuchung fiir die Leser der Biometrika von einigem Interesse sein. 


2. Methode. Die englische statistische Schule vernachlassigt in ihren Unter- 
suchungen ein Verfahren, das von russischen und deutschen Gelehrten oft ange- 
wandt wird (Tchebycheff, Markoff, v. Bortkiewicz, u.s.w.) und neben groBer Strenge 
und Exaktheit noch den Vorzug hat recht elementar zu sein—die Methode der 
mathematischen Erwartung némlich, Mathematische Erwartung einer Grofe (4) 
heiBt bekanntlich soviel als das Produkt aus dieser GroBe und ihrer Wahrschein- 
lichkeit (w), also Aw. Wenn eine Variable eine Reihe einander ausschlieBender 


* Biometrika, Vol. x. Part 1, 8. 179, ‘‘ The Elimination of Spurious Correlation due to Position in 
Time or Space.” By ‘‘Student.” 


Biometrika x 35 


270 «=©Die Verallgemeinerung der Cave-Hookersche Methode 


GréBen annehmen kann, so ist deren math. Erwartung als die Summe der Erwar- 
tungen aller dieser GréBen definiert. Wir werden hier die mathem. Erwartung 
iiberall durch das Symbol #( ) bezeichnen. (A) ist also, z. B., gleich Aw. 


Die hauptsichlichsten Satze tiber mathematische Erwartungen diirften als 
bekannt angenommen werden. Um aber die Nachpriifung der Formeln dieser 
Schrift zu erleichtern, werden wir hier die fiir uns wichtigsten Satze noch kurz 
andeuten : 


(Ql) H(@t+y-2ztu-t...)=H(a)+HYy)-H(a)+H(u)-H).... 
(2) Wenn uw, y, z,... von einander unabhingig sind, so ist 
H(e.y. 2... =f (a) Ee EZ) sea. 
(3) EH (k«)=kK (x), wo k const. ist; und daher auch : 
iE (hy) = he: 


(4) Wenn eine Variable X die Werte 2, 2,...%, annehmen kann, so ist 
die Wahrscheinlichkeit W, daB die Differenz «;—H(«) zwischen den Grenzen 


—aV HE (a?)—[E(2)P und +a V# (a*?)—[E(a)f enthalten sei, gréBer als 1— = 


ae 
wo a gréBer als 1 sein mu (ein Theorem von Tchebycheff). 


In unserer Untersuchung werden wir iiberall statt des wahrscheinlichsten 
Wertes einer Gréfe deren mathematische Erwartung berechnen. 


Bestimmen wir zuerst, wie sich der Korrelationskoeffizient zweier oscillierender 
Reihen verhalt, wenn man deren GréBen durch Differenzen Ai. (Naa Nace 
A’y, A”y, A’’y, ... ersetzt, und darauf untersuchen wir die Frage von den Grenzen 
der Anwendbarkeit der verallgemeinerten Cave-Hookerschen Methode. Um Raum 
zu sparen, werden wir nur die endgiltigen Resultate der Berechnungen anfiihren, 
ausgenommen die 3 ersten Formeln, deren Bestimmung als Beispiel der Rech- 
nungsmethode dienen mige. 


3. Definition. Unter einer oscillatorischen Reihe werden wir eine solche 
Reihe 


XH, Ua, Uy, eee, Uj, ove, Ln 
verstehen, bei der 


HE (a) = H'@) =... = Ea) =. =f (a) =) const. 
und alle einzelnen Glieder von einander vollig unabhingig sind, so dab 
Ef (#;4;) = E («;). E (a), wobei 7+ 7. 


Solchen Bedingungen wiirde zwm Beispiel eine Reihe geniigen, deren Glieder 
die Resultate emer Versuchsreihe mit konstanter Wahrscheinlichkeit darstellen, 
etwa Resultate von Ziehungen aus einer Urne mit m weiBen und n schwarzen 
Kugeln. 


O. ANDERSON 2M 


4. Mittleres Fehlerquadrat. 
Bezeichnen wir a;—H(#) durch &;, so ist 
KE, = E (a; -— E (a)] = # (a) — EH (2) = 0. 
Da die einzelnen & von einander vollig unabhéingig sind, so ist 
E (&&) = # (&). H(&) = 0. 


Das mittlere Fehlerquadrat der Reihe & ist gleich 


Seine mathematische Erwartung wollen wir (nicht ganz in Ubereinstimmung 
mit der iiblichen Bezeichnung) o,? nennen. 


1 n nu 
eran PO 1 [Se] 28 [Se]. sereatoo 
= E (a*) -(H (@)P, 


ein Ausdruck, der oben im Satze 4 (§ 2) unter dem Zeichen der Quadratwurzel 
steht. 


[St-2@r| {ee| 
Andererseits ist aber 1 (Gone gleich A oe | und daher 
o2= B(E). 


5 nN 
Untersuchen wir den Ausdruck abs (a; —- My], wo M, das arithmetische 
ak 


iL 


Sa 


SAD 


Mittel der Reihe 2, also = bedeutet. Da 


Barr t+ (a) + et... +H (a) + En _ 


nN 


o— M,= HE («)+&- E;— M: 


n 
> &; 

(wenn man JM; fiir +— einsetzt), so haben wir: 
n 


E E (ee My» | ae 1, Be st my | _E E aoe oe] 
acl al 1 


See—2366)| 
=n B(E)— 0B (Me) =n (&) — nF | fant @)- B®) 


ie 
b[ Sq -any |= =H Be) 
if 


Um £(&) zu bekommen, muf man diesen Ausdruck durch (nm — 1) dividieren. 
35—2 


272 Die Verallgemeinerung der Cave-Hookersche Methode 


Ks ist also auch 


Um das Fehlerquadrat o*y,z fiir die erste Differenz A’xv zu erhalten, bertick- 


sichtigen wir, dab 
A’x; = (#i — Bir) = (Ei — Eis) 


und E (A'a;) = E (&:) — & (Ein) = 9. 
Daher haben wir: 
n= ; } n—-1 
[‘Stwn- Bay] ['S&- si 


aes H | im a eee n—1 


Sa | |S ee _2E Be fin | +E | 2e°]} 
n—l 1 1 2 


= +> (m1) B(E*)— 0+ (n-1) B(E)} = 28 (B). 


Ks ist also Can — on 
Nach demselben Rechenschema ergiebt sich fiir das mittlere Fehlerquadrat 


der zweiten Differenz A’x; der Ausdruck 60,2 


, dritten ms DG aes i 200,7 
» vierten a PN ie = 700,2 
2k! 
» k-ten . Aas, »» AA On: 
Wir konnen daher folgende Gleichung aufstellen 
re One = o'a'a ahs aN Se: CNL ra 7,0, 2 
Oy, = 2 = 6 = 20 = 70 pe are a sre eeeee (1) 
i k ) 
welche exakt ist, und folgende 
n n-1 n—2 n-3 TN n-~k 
3 (0;—M,) “SA'a? “LA's? “SA” ae "Sama SNC 
1 a 1 = u —_— —— u — 1 —— pe pe 
n—1 2(n—1) 6(n—2) 20(m—3) 7O(n—4) “Qk! k ay * 
TEE a 


welche nur anniihernd richtig’ ist. 


n-k n-k 
D> Atk) x2 D2) (A®e -M (h Ve 
5 5 1 J 1 A‘ ‘a, 
* Ks ist vorteilhafter yeaa und nicht —— zu berechnen. 
i :) raga ee 


pea 


—a 


O. ANDERSON 273 


LHYs 
Ss . 1 
5. Das mittlere Produkt mo 
Wenn zwei Reihen 
X, Xo, X3, shaiie, Xn» 
und Yuva Gas Uae Uns 


beide im Sinne des § 3 oscillatorisch sind, und eine Korrelation nur zwischen 
GréBen mit gleichen Indexen, also zwischen 2, und y;, « und y, 2; und ¥;, u.s.w. 
bestehen kann, so ist es leicht ersichtlich, daB 


E ((x— E@)\[yj— Ey}, =0, wenn i4j, 
Bezeichnen wir y;— H'(y) durch y;, #;— E(w) wieder durch &;, so finden wir 


leicht folgende Ausdriicke : 
Ei 
Pay =H | — =H (Ei), 


n 


| = E (Eni) = pay, 


n—l 
> A’a, A’ Yi 
1 
Pareny= 8 |—Gaa— | = BP 


[E@%— Me) yi- My) 
E | , 


ree a ee ee rs 


("S'a 8) op, A) yy 2k! 
Paw, Agee Nae hee 


Wir konnen daher wieder zwei Gleichungssysteme : 


_Pyraty Parcary Pavrra’y  Pavve atrry Pawa ay 
k! k! 
und 


nN 


n n—-1 n—-2 -3 
> [w—E(«)|[y-H(y)]) DA'aA’y, TAwA"y, & AGA’; 
1 1 1 


n—1 "7S © M2)" = 20 = 3) 
"S Awa, Ativ)y, AM x Ay, 
= t= ———— =p (2a) 
TO(n—4) 7" 2k! eS 
Et kl yi i” —k) 


aufstellen, von denen das erste exakt und das zweite angenihert ist, und welche 
den Gleichungen fiir o,? (also auch fiir 0,2) des § 4 genau analog sind. 


274 = Die Verallgemeinerung der Cave-Hookersche Methode 


6. Das Fehlerquadrat der Fehlerquadrate. 

Betrachten wir jetzt den Bereich der Schwankungen der GréSen der Systeme 
(1a) und (2a) um deren mathematisch zu erwartenden GréBen in (1) und(2). Mit 
anderen Worten, gehen wir (mit Riicksicht auf Satz 4 § 2) zur Darstellung der 
math. Erwartung der Fehlerquadrate der genannten Groé8en iiber. 


S [v; — £ (x)P 4 2) ]2 
Fiir 2 7 ergiebt sich das Fehlerquadrat BSD) a (é )] P 
3 [2:— Ma} 
Fiir * : ergiebt sich das Fehlerquadrat Ee ACO ap aes] : 
al n n(n—1) 
S [A’ajp 
Fir DED ergiebt sich das Feblerquadrat 
(2n — 8) (EB (&) - [A (E)P}+2(a— DBE)? 
2(n—1) , 
"S [Av aiP 
Fiir aCe 2) ergiebt sich das Fehlerquadrat 
{P= 
(9n — 23) (H (E+) —[# (&)P} + '7n — 42) LE (&)P 
9(n —2)? , 
n—-k 


S(A® x; 


: apeal 
Und endlich fiir Ohi 


aan (n—k) 
Tere ae Oe 24) HE) — (BEN) + 4 EYP LAS (2+ 1 
+ A? (n—2kh4+2)+4+ A)? (n—2k4+3)4+...+ 472 (n—2k4+h)] 
+ 2B¢ (2 (8) —[H(E)K +8 [BOR Bat Bet + Bea) 


ergiebt sich der recht komplizierte Ausdruck : 


Wenn Jy, b,, bs, ... by die Koeffizienten der Zerlegung des Binoms (1 +1)* 
k(k—1) 


darstellen, also 6,.=1, b,=hk, b= 1g? USW., 80 ist hier 


Qh! 2 
Ag=(b2+b2+b2+... +02" = | eral 


ht 7 
Aj? = (doby + bb, + babs + 0. + Dp adn? = Fsivese ; 


Qk! 2 
A? = (by bz + bib; +... + dy ob)? = Feeeerd 


Smet w ee ee mere e eer e ee eaee eee essere eeeereeeerereseseeeeeeeeeseerese 


' 2k! 2 
A? = (bb; ar bibj45 ap ooo ar by—jbx)? = Feel > 


Cr ee i ee 


O. ANDERSON 275 


(Oy) (Oe Or) + (by Oy + 6") Fe. + (Oy + be + bP +... + Be 1)%, 
B? = (by bi)? + (bob: + br be)? + (do bi + 1 02 + 6565)? +... 
ar (by b, ar b,b, + bibs <font Op-20R-a) 5 
Be = (b,b:)* + (bob; + Bibi gs)? + (Bob; + D1 Di4a + Dabiye)? +... 
hy (by b; a b, bein + babi+2 tee + Dee Ueen)e 


ee cy 


By, a (bo by). 


Wenn die Verteilung der « (und dies ist der fiir uns interessanteste Fall) 
“normal” ist, so kénnen obige Formelnu betrachtlich vereinfacht werden. Da man 


in diesem Fall # (&) gleich 3 [H (&) oder 30,* setzen kann, so haben wir: 


3 (a — E («)P 
1 


4 
Fir << das Fehlerquadrat 2 (vergl. Biometrika, U1. p. 276). 
nv ; 
> («; — M,) Qa! 
” ial ee 1 ) > he 1 . 
n- 
var Cle Dae oder angenéahert eee 
»” “2(n—1). ” ” (n = 1) ge ’ahner rae 1 ‘e 
SN 
ae): (85n — 88) 04 do! 
> 6(n—1) P i 9 (n— 2) ‘ “4 n—-2° 
n—-3 
=f wy, .\2 
PS eo) (231n — 843) oy! Sox! 
e201 = 2) a 7 50 (n — 3)? 4 i, n—3° 
n—k 
> (A® x; 
Fiir aa endlich kann man das Fehlerquadrat in solcher Form dar- 
ma —*) 
stellen : 


@ oo Nin ~ k) + 2(n—b—1) (Epi) +2@-k-2) C Ses ») 


+2(n—k—3)( k. (ke —1).(k— 2) ee 


(k +1). (E+ 2). (E +3) 

alle OSCE 

(h+1). he +2). (6 +3)... 4g)) 
k.(b—1).(k—2)...2.1 yt 

(k +1). +2). (6 +3)... (+h) 5° 


+2(n-—k—jJ) ( 


+2(n—b—b) ( 


276 Die Verallgemeinerung der Cave-Hookersche Methode 


Ks ist also klar, daB zusammen mit dem endlichen Differenzieren die Unsicher- 
heit der Bestimmung von a,” stetig wichst, anfangs etwa im Verhaltnis 


Nee NTR De ISY Ae 
7. Das Fehlerquadrat des mittleren Produktes. 
s Ei 2 7 2 
urs == orpicbe sich das imonleriudine eerie ete 
S (= Ma) (ys — My) 
Fiir + ear ergiebt sich das Fehlerquadrat 
B (Epi) — (BE P| Eis) P+ on2o,/ 
n n(n —1) 
n—-1 
> A'a;A’y; 
Ace aa . Pee Pee f 
Fir SON ea ergiebt sich das Fehlerquadrat 
(2n — 8) {EB (E2r2) — LE (Ea +(e — 1) (LE EW) P+ 02207} 
2(n —1) j 
n-k 
> AMa, Ay; 
Im allgemeinen Fall Soi = ae erhalten wir fiir das Feblerquadrat folgenden 
Sh) 


kik! 
Ausdruck : 


Timp {48 — 28) UE EWE) — LE EP 


+ 2 {LE (Eh) P + 0,70,7} [A (mn — 2h +1) 4+ A? (n-— 2k 4 2)4+ ... + Ae (n— 2k +k)] 
+ 2B? (E (E2yr2) — LE (Ei) P} + 4 (LE (Esti) P + oe2o,?} (BP + Be +... + Bi} , 
wo die Koeffizienten A,’, A,’,... B,?, By, ... dieselbe Bedeutung haben, wie in § 6. 

Wenn 2; und y; einander vollkommen gleich sind, so ist 

[E (Emi) P={H(P)Paorts HEM A)=H EA); Foy = a7", 
und obiger Ausdruck fallt mit dem in § 6 zusammen. 

Fiir den Fall der “normalen” Verteilung kénnen wir auch alle diese Ausdriicke 
betrachtlich vereinfachen, besonders wenn wir # (&;;) und #(&?.?) als Funktionen 
von rz, darstellen. Olbne aber hier darauf einzugehen, wollen wir jetzt tiber den 
Korrelationskoeffizienten ins Klare kommen. 

8. Definition des Korrelationskoeffizienten. 

Der Korrelationskoeffizient R wird gewoéhnlich nach der Formel 

2 (a — Ms) (yi— My) 
R= Z berechnet. 


/S@- May 3 i,y 
1 1 


-O. ANDERSON 277 


Zu welchem Ausdruck ist er nun als empirische Anndherung aufzufassen, 


S (a; — Mz) (yi — My) H E (a; — Mz) (yi - m,)| 
1 e 1 


2 


oder zu 


zu = : : 
r/ See —M,) S (y; - M,) Neo tas M,)"] E(S (y:— My] 
1 1 1 1 


Beide Formeln sind durchaus nicht mit einander zu identifizieren und fallen 
nur in erster Anndherung zusammen. Da die zweite aber bedeutend leichter zu 
handhaben ist und dies auch mehr den tiblichen Rechnungsmethoden der englischen 


Schule entspricht, so definteren wir ry, als 


EK E (a; re M;) (Yi -_ | 
1 


ve E E ee Mz) | E E (y;— My} | | 


E : p : 
Anders ausgedriickt ist yy = Pay , WO Dzy, Gz, Fy die Bedeutungen haben, 


Oxy 
welche wir ihnen oben in §§ 4 und 5 beigemessen haben. 


9. Das Verhalten des Korrelationskoeffizienten zweier oscillierender Rethen 
x und y, wenn man deren GroBen durch Differenzen ersetzt. 


Fiir die k-te endliche Differenz von z und y haben wir 


an ee : . Qe! 
y a A! ao, A 4 } — i) 
- ( 1 fe _ Paha, AMy kt. ey Pe 
Afiz, AMy eg n—k nk = NE 2 2 = 9 ye 
SR ae id A TAlkin + T Ah) 2k! 2k! 
E\ > Ae2|.H| & A®y2 Ax ly 2: ee 
paar ae lel lo” EL Ri’ 
So Gee 
Tar, Ay = Guan oy Pay: 


Wir haben also ganz allgemein das genaue Resultat : 
Try = Ya'e, A’y = TA" x, A’y — TA" x, A”'y a T atk), Ally: 


Da aber diese r unbekannt bleiben und wir fiir ein beliebiges Taide, A@y DUT 
n—t . 
AMZ, AMY, 


dessen Anniherungsformel a, a —— kennen, so miissen wir 


wiederum feststellen, inwiefern man sich in der Praxis auf die Ubereinstimmung der 
empirischen Koeffizienten mit deren mathematischen Erwartungen verlassen kann, 
wie gro$ also die Unsicherheit ihrer Bestimmung zu schatzen ist. 

Biometrika x 36 


278 Die Verallgemeinerung der Cave-Hookersche Methode 


10. Mitileres Fehlerquadrat des Korrelationskoeffizienten der endl. Differenzen 
zweier Rechen. 


Aus der Formel &;=—-~ : , kann man folgenden Ausdruck 
a 
vi 5, Aa > A by 
1 1 
ableiten : 
n—k n-k n—-k 
> AMa A® y; > A a2 > Aye 
- Parc k : or Alk : o 
ae = ‘bt i 
IR Ty ay Ac, Ay ni Aa m Aly 
- —— ee, a) eT) aT aa ate p) 
ee Pawea®y 20° Aix 20° Amy 


der nur in erster Annaherung und, wenn alle 4 Briiche des Ausdrucks echte sind, 
richtig ist; dies ist bei groBem n der Fall. Und ferner ergiebt sich daraus die 
Formel : 


+6 (1 =? xy)? ; Pe ee iden: ene k.(k—1) 2 
Gas sisi i Ol eel =e " 2) ey cea 


ig (Pan (er 
+2(n— 22) agen (k +2). Gaal ox 
k.(k—-1).(k—-2).. ; 
+2(n—k— ®) (ary. (K+2).(k+38).. 21 
"3 (Ama) 


7 
ii a = k) 


(vergl. dazu die Formel fiir in § 6). 


Die Formel fiir on ist immer nur dann giltig, wenn man 


ea) ee (Eepe) —[H (Ea? , HE) — [LA (e)P ie E (p*) — [EB Gp?)P 


n LE (Evi)? 4[H (é)] 4 [EB (p*)P 
_ Ege) — Hkh) EE) BG) = BE) BOY) 


_Beye)- E©). st 


2H (&*). Bp’) 
gleich Sal evar setzen darf (vergl. Biometrika, Vol. 1x. p. 4). 


Aus der Formel fiir oR erhalten wir: 
byes (1 = xy)? 
Ry n ; 
jar 1 Lata) 3n a 4 
Ry n-1 °2(n—1)’ 
"i (l=) 7 80% 88 
Rs a ee 718 (m=2)- 
_Gd=—ry)? 231n — 848 
Ryn —3  * 100 (n—8)’ 


u.S.W. 


2 


O. ANDERSON 279 


Die Fehlerquadrate der Korrelationskoeffizienten aufeinanderfolgender Diffe- 
renzenordnungen verhalten sich folglich zueinander ungefahr wie 2:3:4:5.... 


Die Unsicherheit wachst also mit zunehmender Differenzenordnung etwa im 
Verhaltnis 
N DEEN! O uA) AES Nf Dyions 
ll. Korrelationskoeffizient zweier zusammengesetzter Rethen, die aus oscillato- 
rischen und evolutorischen Elementen bestehen. 


Da “Student” diese Frage treffend dargelegt hat, konnen wir uns kurz fassen. 
Wenn wir in Betracht ziehen, daB fiir uns die evolutorische Komponente einer 
Reihe schon dann in der Praxis verschwunden ist, wenn sie im Verhialtnis zur 
oscillatorischen Komponente so klein geworden ist, daB sie nur die 3, 4", u.s.w. 
Zahlenstellen des Ausdruckes fiir R beeinflussen kann,so kommen wir zum SchluB, 
da8 nicht nur Komponenten, die durch eine Parabel héherer Ordnung darstellbar 
sind, sondern auch solche, denen nur transzendentale Gleichungen (z. B. Sinus- 
reihen) gentigen, beim endlichen Differenzieren eliminiert werden. Ja mehr noch, 
man kann beweisen, daf iiberhaupt alle mehr oder minder “ glatten Reihen,” alle 
bei denen eine geniigende positive Korrelation zwischen den Nachbargliedern 
bemerkbar ist, fiir die Praxis beim endlichen Differenzieren verschwinden. Das 
verallgemeinerte Cave-Hookersche Verfahren ist daher augenscheinlich ein sehr 
universales Mittel, die Korrelation oscillatorischer Elemente aus zusammengesetzten 
Reihen herauszuschiilen. Es hat aber einen Haken, auf den hier noch hingewiesen 
werden muf. 


12. Kann man aus dem Verhalten der Rethe R,, R,, R,, ... Ry bestimmen, ob wir 
den Korrelationskoeffizienten rein oscillatorischer Reihen vor uns haben? “Student” 
scheint zu glauben, daB8 wenn irgendein Rf; seinem Vorgiinger R;, gleich ist, wir 
es sicher mit dem Korrelationskoeffizienten oscillierender Elemente zu tun haben. 
Vor einem solchen Schlu8 ist nachdriicklich zu warnen. Wie es meine (fiir diesen 
Artikel etwas zu langwierigen) Berechnungen zeigen, kénnen zwei Nachbarkoefti- 
zienten h;, R;, auch bei stark evolutorischen Reihen einander ungefahr gleich 
sein, und die Wahrscheinlichkeit eines solchen Zusammentreffens ist gar nicht sehr 
gering einzuschatzen. Nur wenn wir, von irgendeinem #; angefangen, immer 
dieselbe GroBe fiir R erhalten, also Aj = Rji.= Rj4.= Rj4;=, wird ein solcher 
SchluB berechtigt sein, und je linger die Reihe gleicher R, desto wahrscheinlicher 
wird dieser Schlu8. 


36—2 


STATISTICAL NOTES ON THE INFLUENCE 
OF EDUCATION IN EGYPT. 


By M. HOSNY, M.A., B.Sc. 


The statistical returns for Egypt are—as compared with European data—still 
in a somewhat elementary stage. Age-distributions are of very little value, and 
in the case of infantile mortality we have only information for certain towns. 
Further, in the larger towns there is a considerable cosmopolitan element, which 
gives them a widely different character from the often sparsely populated rural and 
desert districts. Education is not compulsory, and schools and literacy are largely 
confined to Cairo, Alexandria and the Canal Government, even when we exclude 
all foreign scholars. In the same way criminality* preponderates, in an inverse 
order it is true, in these three districts, but it is not absolutely certain whether this 
is due to their more efficient policing, to the presence of more foreigners, or to a 
real absence of crime in the rural populations. Crime does not appear to arise in 
Egypt from poverty or drunkenness, two of the main factors of its origin in 
Western Europe. The criminal, indeed, is rarely habitual; he is an amateur, 
rather than a professional, and criminals are more often well-to-do, their crimes 
arising from motives of revenge or passion. 


The fact that criminality in Egypt is highly correlated with literacy and 
scholarship would be noteworthy and might possibly be used as an argument 
against education, did not the association of crime and education arise from the 
prevalency of both in the more populated districts, where again we find the 
greatest abundance of foreigners. Naturally such questions arise as: 


(i) Are the foreigners—and if so, which section of them—to any extent 
responsible for the prevalence of crime in the districts frequented by them ? 


(ii) If we allow for urban conditions, will there still be found a high asso- 
ciation of crime and education ? 


It is perfectly easy to obtain from the Egyptian Census-. we used that of 
1907—the number of foreigners of each denomination in the various Egyptian 


* We understand by “criminality” in this paper, not commission of but conviction for crime. 


M. Hosny 281 


governments. The only difficulty here was the presence of British troops in 
Cairo and Alexandria, which placed that nationality in an anomalous position. 
These were estimated approximately and subtracted. The following groups of 
foreigners were then dealt with: (a) Ottomans, (b) British subjects, French, 
Austrians, Germans and Russians*, (c) Greeks, (d) Italians. The Greeks and 
Italians were separated from the general European group (b), because they 
are largely differentiated, the Greeks being frequently small traders and the 
Italians often manual workers. Their large numbers also justified a separate 
classification. 


Table I gives the foreigners per 10,000 in the 17 Egyptian districts we 
were able to deal with. It will be noted that the Greeks far outnumber other 


TABLE I. 


Foreigners per 10,000 and Population per sq. kilometre. 


Europeans other | Population 
Governments Ottomans | than Greeks and Greeks Italians | per sq. 
Italians | | kilometre 
Cairo .... 0 a 453, 312 | 298 204. 6060 
Alexandria oe god 661 514 745 482 6780 
Canalt ... Bas aon 416 583 846 445 7666 
Beherat ... Si hee 43 23 31 11 178 
Charkieh ae ae 29 5 24 | 1 257 
Dakahlieh and Damietta 18 5 | 18 3 346 
Gharbieh 2 0) g 0) 226 | 
Kalliuhieh 8 3 13 2 | 469 
Menufieh 2 1 7 0 | 618 
Assiutt ... + gz 3 il | 454 
Assuan ... he 8 5 13 | 4 | 533 
Beni Suef Aeneas 15 6 9 1 | 351 
Fayoum ... 10 3 4 0 | 255 
Gerga 2 0 2 0 532 
Guizeh .. 6 6 4 33 447 | 
Kenat ... 5 4 4 2 339 
Miniat 10 5 7 1 458 | 


foreigners, but that all foreigners are concentrated in the Cairo, Alexandria and 
Canal governments. 


It was far more difficult to obtain a measure of urban conditions. We had 
to take very rough measures of the density of the population, because the limits 
of certain areas are too vaguely defined to be of any service. El] Arish has been 
excluded from the Canal district, Suez and Sinai have also been excluded as there 
is no enumeration of them with respect to criminality, literacy and scholarship. 
These densities, w.th such value as they have, are given in the last column of 


Table I. 


* The contributions from other smaller nationalities were omitted. 
+ Various approximations and omissions occur in these cases in obtaining density. 


282 Statistical Notes on the Influence of Education in Egypt 


Table II provides the number of male criminals per 1000 of the male popu- 
lation, the literacy or number of male persons able to read and write per 100 of 


TABLE II. Hducational and Criminal Indices. 


| Male | Titer Male Scholars 
Governments Criminals per : er acy. 5—19, per 100 

| 1000 males | P* 100 males boys of those ages 
Cairo ar er “ae 12°90 28°03 30°20 

| Alexandria ae aA 14°15 | 30°09 19°99 
Canal and E] Arish ... 22°30 23°39 8°54 

| Behera ... ae Sac 5°30 | 9°29 1-01 

| Charkieh aie ae 4°20 9°09 1°66 
Dakahheh and Damietta 4°35 5 {salts} 1°76 

| Gharbieh ; a 5°65 8322, 3°04 

| Kalliuhieh a fs? 6°65 8:13 1°39 

| Menutieh bee ae 3°85 8°45 1:06 
Assiut sate 5°45 701 4:02 
Assuan ... Are well 4°20 7°68 0°82 
Beni Suef Dial 8:42 2°03 
Fayoum ... 6°85 6°54 1°85 
Gerga 3°95 5°84 2°22 
Guizeh A 5:10 6°38 1:15 

|} Kena... Re ie 3°45 5-34 1°66 

| Minia 5:20 (es Sule 


the male population*, aad the number of male scholars aged 5 to 19 per 100 of 
the native boys of those agest. 


We shall use the following symbols to denote the factors which occur in 
Tables I and IT: 
O = Ottomans, G=Greeks, J = Italians, 
= Europeans other than Greeks and Italians. 
C= Criminality, 2 = Literacy, S=Scholarship, D = Density of Population. 
Each government was treated as of equal weight, although the populations 
vary from 233,000 in Assuan to 1,485,000 in Gharbieh. The standard-deviations 
and product-moments were found without grouping. The following results were 
obtained : 


Means Standard Deviations Correlations 
No = TO15, el 4-791, Noh + 8450) aE ‘0468, 
1 — WOO, op = CoAl: Tos = + 6242 + 0999, 
mg = 5031, og = 7'735, rrs = +9028 + 0308, 
Mp = 1528, op = 2475°5, 
Correlations : 


rpc = +9614 + 0124, rps= + °8097 +:0566, = rp, = +9563 + 0138. 
Now at first sight these results would seem to indicate a very bad influence of 
education on crime. Where literacy and scholarship are greatest, there criminals 


* Kgyptian Census, 1907, p. 99. 

+ Foreign male scholars are excluded in the case of Cairo, Alexandria and the Canal. They have 
no sensible numerical existence elsewhere. Criminals and scholars are taken from the Annuaire Statis- 
tique de VEgypte, 1912, pp. 95 and 135. 


M. Hosny 283 


are most numerous! And a superficial argument might be used to condemn the 
character of education in Egypt, or education in general. But it will be clear 
on examination of the isolated values that the observed high correlations arise 
solely from the urban character in Egypt of both criminality and education. We 
have endeavoured therefore to correct this by finding the partial correlations for 
constant density of population. 


There now result 

prog = — 9554 + 0143, 

pcr = — 9231 + 0242, 

blips = +7480 +0721. 
Thus, while there still remains a quite considerable relation between the prevalence 
of literacy and scholars for constant density, we find that for a constant degree of 
urban conditions, the greater the literacy and the greater the amount of education 
the less will be the criminality. The negative correlations are now even higher 
than the uncorrected positive ones and of course are markedly significant. While 
admitting the slender nature of the Egyptian data, we think that this swinging 
over of the relation of crime and education when we correct for density is sug- 
gestive, and it would be of interest to work out similar correlations for states 
in which the statistics are of a more ample character. It does, however, appear 
reasonable to assert that there is no evidence to indicate that education leads 
to criminality—rather the reverse—in Egypt. 


We will next consider the influence of the presence of foreigners in Egypt. 
We find: 


Means Standard Deviations Correlations 
Mo= 99°53, oo — 195-60; Too = +°8425 +0478, 
Mp= 86°88, Bea lleeyidy rao = +°9546 £0145, 
Mg = 119-41, og = 25661, roo= +9429 +:0181, 
m= 68°24, oa, = 152-04, rio= +9192 + 0254. 
Correlations : 
po = +°9575 +0136, rpm = +:9844 + 0050, 
Ppa = +9491 + 0162, rpr= + ‘9617 + 0123. 


Here, if we judged by the raw correlations only, we must assert that the corre- 
lations of crime with the presence of foreigners are so high, that the foreigners 
must be corrupting the Egyptian population. But again the association only 
arises because the criminals and foreigners are both prevalent in the big towns. 
If we correct for density of population, we find the results are very different. Thus 


we have: 
broad >= — ‘9811 +0061, preg=t+ "1692 ap ‘1591, 
p’ac = +°3524 + °1483, pro = — 0718 + 11628. 


It is now obvious that the correlation of Europeans other than Greeks and 
Italians with criminality has become insignificant having regard to its probable 
error; the correlation of the presence of Italians and criminality is now negative, 
but less than its probable error. Thus of Christians only the presence of the 


284 Statistical Notes on the Influence of Education in Egypt 


Greeks may possibly, but not certainly, be detrimental. The Ottomans have 
now a large negative correlation of a quite significant character, or we might 
assert that the presence of Ottomans tends to diminish criminality. The Greeks 
are frequently moneylenders and alcohol dealers, and the Ottomans, especially 
the Arabs, have among them a good many religious teachers. 


We have, however, to note that criminality is greatest in the Canal Govern- . 
ment, where Europeans and Greeks are most frequent, while the Ottomans are 
most numerous in Alexandria, where crime is almost 40% less than in the Canal 
Government. To test the influence of the three densely populated governments, 
we put the Canal proportion of the Ottomans at Cairo, that of Cairo at Alexandria 
and that of Alexandria at the Canal. There resulted: 

Toc = +°9707, instead of +°8425, 
Tpo = +'9870, instead of + °9575, 

leading to pYoc= + °4918, 

or we may safely say, that if the proportions of Ottomans at Alexandria and along 
the Canal were interchanged, then no relation between the presence of Ottomans 
and the absence of criminality would exist, indeed the relation would probably be 
reversed. The prevalence of the Ottomans in Alexandria has been attributed to its 
more temperate climate. There is certainly a large Ottoman element in Alexandria, 
there being 21,827 Ottomans out of a population of 332,246, and it is larger 
than any other foreign element except the Greeks. In Cairo, with 29,516 out 
of 654,476 inhabitants, the Ottomans exceed any other single foreign element. 
It is conceivable, therefore, that they may be able to influence the moral tone of 
those towns. It must be borne in mind, however, that crime is far more frequent 
in the Cairo and Alexandria governments than in the more purely rural districts, 
and we can scarcely suppose that Cairo and Alexandria would reach the still higher 
criminality level of the Canal, were it not for the presence of the Ottomans. In 
the Canal Government there are exceptional conditions, and we can hardly assume 
that a transfer of the Ottomans from Alexandria to the Canal would interchange 
their proportions of criminality. Greeks no doubt flock to the Canal for business 
purposes, the other Europeans largely for control purposes; the Ottomans, 
relatively speaking, avoid it. Without further analysis it would not be possible 
to assert definitely that the presence of Ottomans reduces crime. It may be 
doubted whether the presence of foreigners, with the possible exception of the 
Greeks, is really associated with the extent of criminality in Egypt. 

A further investigation was undertaken in regard to the possible influence 
of education on infantile mortality. The birthrate and deathrate in Egypt are 
both remarkably high. Thus for the years 1899-1909 inclusive the average 


rates were: 
Births per 1000* Deaths per 1000 Excess of Births over Deaths 


Cairo 40°7 35°7 +50 
Alexandria 38:0 31°8 + 62 


* Still births not included. 


M. Hosny 285 


Many European towns with half the above birthrates have considerably greater 
excesses of births over deaths. 


Unfortunately the infantile mortality is only recorded in Egyptian towns and 
not in the governments at large or in the rural districts. We are obliged, there- 
fore, to deal with these only when considering the relation of education to infantile 
mortality. From p. 49 of the Annuaire Statistique de V Egypte, 1912, we obtain 
the infantile mortality for 1911, and note at once how extraordinarily high it 
stands. From p. 286 of the Census of Egypt, 1907, we take the percentage of 
male literates in the total male population, and from the Statistique Scolaire, 
1912-1913, p. 74, the percentage of scholars in the total population*. As it 
was possible that the density of town population might influence the results, we 
took the number of persons per house, which was about the only social factor 
available. . This will probably represent fairly closely the average size of family. 
This was taken from the Census, 1907, p. 286. The mean, 5°64 persons per 
house, suggests that the average number of living children can hardly exceed 
three. The marked relationship that occurs in European towns between gross size 
of family and infantile mortality cannot be satisfactorily tested on the Egyptian 
data, because we cannot ascertain the infantile mortality in each size of family. 
The number of persons to the house is indeed rather a measure of net than 
gross family, and we only know this as an average value for each town. It does 
not follow that a town with a low number of persons per house is one with 
small gross families; the low number may be due to the heavy infantile mortality 
itself. Accordingly the correlation between persons per house and infantile 
mortality is not necessarily even a measure of the influence of overcrowding on 
infantile mortality (although this is often supposed to be the case); it is con- 
ceivable that a high infantile mortality might be the source of a low number of 
persons per house, and the unravelling of cause and effect is only possible 
where we know not only the number of persons per house, but its relation to 
both the gross and net family of that house. 


Let J = Infantile mortality, Z = Literacy, S = Scholars, P = Persons per house. 
Then we have the following results : 


Means Standard Deviations Correlations 
M, = 29°30, o, = 7608, ry, = —'1040 + 1500, 
M,, = 21:96, 7 ouligue Trg = +5809 + 1278, 
M,= 5°569, og = 2951, ris = +0093 + 1508, 
Mp= 5°643, op = 1428, oe 


Correlations : 
rprp= + 1675 +°1487, rp, =— "842141487, rpg = +0296 4 °1508. 


* The scholars were taken for 1910-1911, the year of infantile mortality, but this involved 
the assumption that the foreign scholars were the same in numbers in 1910-11 and 1912-13, probably 
not a very inaccurate assumption, which in any case affects little more than Cairo, Alexandria, Suez 
and Ismailia practically. It is the number of Egyptian scholars that is rapidly changing and the 
scholars dealt with in our ratio are Egyptian only. 


Biometrika x 37 


286 Statistical Notes on the Influence of Education in Egypt 


TABLE IIL. 


Infantile Mortality. Persons per House and Education. 


Infantile Male Literacy ‘ cholars Persons 

Town Mortality per 100 of per 100 of per 

per 100 births population population House 

Cairo ee. 32°9 28°03 6°12 4°62 
Alexandria Sec 26°9 30°09 3°41 8°43 
Damietta 18°1 7°06 1°89 6°96 
Port Said ae 21:0 24°13 2°64 4°14 
Ismailia ... 26 16:0 28°05 1:22 4:10 
Suez oe See 26°9 25°74 0°64 3°85 
Benha ... sre 29°6 20°54 4°87 5:47 
Zagazig ... we 27°9 25°67 6°47 6:07 
Tantah ... ae 29°6 25°62 9°45 5°18 

| Mansorah sae 21°4 26°77 6°20 4°60 
| Chibine El] Kom 16°1 18°71 5:10 4°92 
Damanhur ee Oieo 19°54 3°55 7:27 

| Guizeh ... a 35°6 19°23 eS 6°12 
| Fayoum ... fee 40°1 15°55 4°78 9°34 
Beni Suef Sur i7/ 2il 21°15 7:19 511 

| Minia... ae 38°2 21°96 8°89 4:96 
Assiut ... ist 33°6 20:92 11°48 6:00 
Sohag... ee 29°0 DOS | 5°58 6°15 
Kena see oe oe 16°76 4°80 5:14 
Assuan ... ae 41°1 21°31 5°78 4:09 

hi 1 


It will be clear from these results that there is no significant relation between the 
literacy of the male population and infantile mortality. There is also no significant 
relation between the number of persons to a house and the number of scholars, 
Le. it does not appear to be the more crowded towns which have the largest 
percentage of scholars to the population ; Alexandria and Damietta, for example, 
have considerably more than the mean number of persons to the house and 
relatively few scholars. On the other hand a larger number of literates marks 
less crowding. Crowding and infantile mortality are slightly related, but con- 
sidering the probable error, not with definite significance *. 


While literacy has no relation to the infantile deathrate, it is noteworthy 
that there is a significant correlation (+°53809 +°1278) between the number of 
scholars and the infantile deathrate, which is greater where there 1s more education. 
Now this either suggests that many scholars mean large families and large 
families correspond to increased infantile mortality, which is usual, or that the 
towns in which there are the classes who educate their children have a higher 
infantile deathrate. The only means, and those inadequate, of testing the first 


* This agrees with the result for overcrowding and infantile mortality in English manufacturing 
towns, where the correlation is very small and sometimes has one sign and sometimes the other. 


M. Hosny 287 


assumption are to take the partial coefficient between scholars and deathrate for 
constant number of persons per house. 


We find*: pris => + 5336 + 1107 5 
similarly pry, = — 0504 + 1543. 


There is thus a slight increase in the relation of scholars to deathrate when 
we take a constant number of persons per house, and it is hard to believe that 
the relation is indirectly due to size of family. The second result shows that 
literacy has no relation to the infantile deathrate. Towns like Alexandria, 
Damietta, Port Said, Ismailia, and to a less extent Suez, with a low infantile 
deathrate have a low education rate, and towns like Cairo, Guizeh, Beni Suef, 
Minia, and Assiut, with high infantile deathrates have high education rates. The 
first towns are on the sea or the canal, the second in the Nile Valley; it is con- 
ceivable that the latter are the more unhealthy for the infant; it would need 
special local knowledge to explain why education has been most accepted above 
Cairot. There does not, however, seem any relation between ignorance, as 
measured by literacy, and a heavy infantile mortality, nor on the other hand can we 
assert that education and European influence have certainly increased criminality. 


* The value of p7,5 is + °0207 +1508, and is therefore not significant. 

+ It is noteworthy that there is no relation between literacy and number of scholars, i.e. education 
of children does not appear to follow the power to read and write in their parents, that is to say 
if we judge by the averages in towns and not by individuals. 


37—2 


HEIGHT AND WEIGHT OF SCHOOL CHILDREN 
IN GLASGOW 


By ETHEL M. ELDERTON, Galton Fellow, University of London. 


In 1905-6 an enquiry was made in the Public Schools of the School Board for 
Glasgow as to the height and weight of all the scholars, the occupation of the 
parents, the number of rooms occupied etc. By permission of Sir John Struthers, 
of the Scottish Education Office, these schedules were most kindly placed at the 
disposal of the Galton Laboratory. 


The number of children concerning whom the enquiry was made is over 
seventy thousand of ages 5 to 18 years. The schools from which these children 
came were divided into four groups according to the district in which the schools 
were situated. 


Group A comprised schools in the poorest districts of the city. 


ee a » in poor districts of the city. 
ee. ‘i , in districts of a better class. 
eed 3 » in districts of a still higher class with which are 


included four out of five Higher Grade Schools. 


The data were originally used by the Galton Laboratory with the object of 
discovering how far the physique of school children, judged by their height and 
weight, is affected by the occupation of the father and the employment of the 
mother. With this end in view the necessary data were entered on cards. 
Children over 14 were excluded and all children who had not both parents alive 
were also excluded; this left us with 30,965 girls and 32,811 boys. 


The object of the present paper is to ascertain what is the average weight of 
a child of a given age and a given height. 


The first step in this enquiry was to sort the cards and form tables giving the 
distribution of weight for each height at each age in each school group, and this 
laborious work was carried out very largely by Miss Augusta Jones; this step 
necessitated making 72 tables, and she is responsible for 58 of them while the 


EK. M. EvprErtTon 289 


remaining 14 are due to Miss H. Gertrude Jones; I have to thank most heartily 
these colleagues for their very efficient help in this matter. 


The three factors with which we are concerned are the age, height, and weight 
of school children. The instructions issued to the teachers in the schools for 
recording these three facts were that ages were to be given to the nearest year, 
weights to the nearest pound, and heights to the nearest quarter of an inch*. The 
method of recording ages is very important. Ages being recorded to the nearest 
year, this means that children classed as 6 years were from 5°5 years to 6°5 years ; 
and the average age of this group was 6 years; this is not the method most 
frequently employed for recording ages; “age last birthday” is generally used 
and if “age last birthday” be given as 6 years then the children of that age are 
from 6 to 7 and the average age of children in this group is approximately 6°5 years. 
It will be seen at once that a comparison of weights and heights of two groups of 
children of 6 years cannot be undertaken until we know which method of recording 
ages has been adopted. The height and weight of these Glasgow children have been 
compared by Dr Leslie Mackenzie and Captain Foster+ with the height and weight 
of children as given by the Anthropometric Committee of the British Association 
and it is pointed out that at each age the average weight of the children is 
uniformly below the “standard of the Anthropometric Committee,” and that 
generally speaking the same thing applies to height. As a matter of fact this 
point as to age has not been noticed by these writers and children whose average 
age is 6 years in Glasgow are compared with children whose average age is 
65 years, naturally the younger children are shorter and lighter. There is further 
an important question to be asked: Which standard of the B.A. Anthropometric 
Committee ought to be selected? To this point I return below. 


As I have said the Glasgow children’s ages were recorded to the nearest year}, 
but the Anthropometric Committee recorded age last birthday, and before these 
children can be compared the six months extra growth must be allowed for. This 
is quite easily done by finding the regression of height and weight on age and 
adding half the regression coefficient to the height and weight of the Glasgow 
children. We have found the regression for children of 5 to 14 inclusive to be as 
follows: 


Boys Girls 
Regression of Weight on Age ... 4564 4916 
5 Height on Age foe 1:807 1937 


* Tt is not known what record was made when an exact half year, an exact half pound or an exact 
quarter inch occurred. 

+ Report on the Physical Condition of Children attending the Public Schools of the School Board for 
Glasgow, by Dr W. Leslie Mackenzie and Captain A. Foster. Wyman and Sons, 1907. 

+ The actual wording of the Glasgow direction to school teachers runs: “In recording age, disregard 
months and record to nearest year; thus 6 years 7 months record as 7 years, 8 years 3 months 
record as 8 years.” It is not clear how 6 years 6 months would be recorded; we have assumed as no half 
years are entered in the schedules that an exact calculation was made in the case of each child of 
doubtful age to ascertain whether it was or was not past the half year. 


290 Height and Weight of School Children in Glasgow 


This means that we must add 2°28 lbs. to the weight of the Glasgow boys and 
‘90 inches to their height and 2°5 lbs. to the weight of Glasgow girls and ‘97 inches 
to their height before we can compare them with the Anthropometric Committee’s 
standard. The Glasgow children still fall below the “ Anthropometric Committee’s 
Average” but not to the appalling extent shown in the diagram at the end of 
Dr Mackenzie’s and Captain Foster’s Report. Personally I should hesitate to 
compare actual height and weight of Glasgow school children with the so-called 
Anthropometric Committee’s standard. The so-called standard is taken from the 
Final Report of the Anthropometric Committee of the British Association, 1883. 
In Tables XVI—XIX, the average heights and weights at different ages of males 
and females of different classes of the population of Great Britain are given. For 
example in the case of stature we have four classes: Class I, Professional Classes, 
Town and Country, 10,739 individuals, ages 9 to 60; Class II, Commercial Classes, 
Towns, 5472 individuals, ages 8 to 60 (5 below 8 are of no service for means); 
Class III, Labouring Classes, Country, 8727 individuals, ages 3 to 70 (8 below 8 
are of no service); Class 1V, Artizans, Towns, 126,236 individuals, ages 3 to 60, and 
451 babies at birth. All these data are pooled and the column headed “ General 
Population, All Classes, Town and Country,” and it is this “General Population ” 
which is so frequently cited by various medical authorities, including Dr Leslie 
Mackenzie and Captain Foster, as the Anthropometric Committee's “standard.” 
What they understand by such a “standard” it 1s impossible to say. It does not 
represent the “General Population” of Great Britain, but the total population 
measured by the Committee. In this all the babies are artizan babies, there 
are only 8 children from 0 to 2 and these belong to the labouring rural classes, and 
there is no professional class contribution until after 9 years of age. Then the 
various age groups are made up from various social classes in proportions which 
bear no relation whatever to their actual proportions in the kingdom at large. For 
example, the average height of lads of 18 is determined from 1724 of the pro- 
fessional, 62 of the commercial, 148 of the rural labourer, and 371 of the town 
artizan classes! It will be quite clear that a “standard” reached in this way 
means absolutely nothing at all, and yet this is the “standard” which, attached to 
numerous weighing machines is posted in innumerable public places up and down 
this country. It does not in the least represent any “General Population” of 
Great Britain. To be a standard of the general population each class should have 
been properly weighted, and this cannot be done as in certain classes certain ages 
are quite inadequately represented, or not represented at all. There is in fact no 
such thing as an “ Anthropometric Committee’s standard” for either height or 
weight. The only thing that is possible is to compare the corresponding social 
class in that Committee’s measurements with the measurements under considera- 
tion. In the case of Dr Leslie Mackenzie’s and Captain Foster’s data, this is 
undoubtedly the Class IV, “ Artizans, Towns.” Such a comparison is made in the 
accompanying diagrams. It will be seen that the Glasgow children as far as 
height is concerned are the equals if not the superiors of the Anthropometric 


EK. M. ELpERTON 291 


Committee’s artizan class. In weight they appear to be somewhat less, but here 
Dr Leslie Mackenzie and Captain Foster have overlooked the fact that the Glasgow 
children were weighed without boots, but the British Association Committee weighed 
in ordinary indoor clothing, i.e. with boots or shoes on. Now girls’ boots weigh 
as much as 11 to 21 lbs. and boys’ boots 1? to 34 lbs.* Hence in comparing 
children in Glasgow with those six months older, Dr Leslie Mackenzie and Captain 
Foster have dropped 21 lbs. in weight, while in comparing children without boots with 
those with boots they have dropped another 14 lbs. to possibly 3$ lbs. We should 
anticipate therefore that their readings would be 3} to nearly 6 lbs. too small f. 
There is in our mind very little doubt that the weight of the Glasgow children is 
at every age equal or superior to the weight of the artizan children measured 
by the British Association Anthropometric Committee and the statement of 
Dr Leslie Mackenzie and Captain Foster that “at each age from 5 to 18 the average 
weight of the [Glasgow] children is uniformly below the standard of the Anthro- 
pometrical Committee{” arises from their having entirely overlooked the con- 
ditions as to class, age and manner of weighing which were adopted by that 
Committee, a knowledge of which was essential to any comparison with the Com- 
mittee’s data. In the diagrams on pp. 292-3 we have given the Glasgow measure- 
ments set against those of the artizan class of the Anthropometric Committee, 
and the reader will see clearly how all the arguments based on differences between 
the Glasgow and the “ Anthropometric standard” fall at once to the ground. 
There is nothing exceptional in the Glasgow data, they differ of course from data 
for the children of the professional classes, but this difference is not confined to 
Glasgow. Apart from this point it is essential that the ages of the two groups of 
children should be the same and not differ by six months. 


In the data used for this paper, children of 5 were omitted; they are few in 
number and are not therefore likely to give such reliable results when each age 
group is used separately. The mean weight for each height in inches was then 
found and the regression equation calculated. These equations are given in 
Table I. It will be observed from these equations that, though some irregularities 
occur, generally speaking weight increases more rapidly for a given height in the 
better school groups, at the later ages, and for girls more than boys except at 
ages 6 and 7. ; 

We can see from these equations that the multiple regression surface for 
weight on height and age is not absolutely planar. It can be shown that it is 


* New ‘‘tacket ” boots for girls of five in Glasgow weight 1 lb. 5 oz. falling to about 1 lb. 3 oz. 
when the tackets are worn down ; for girls of fourteen 2 lbs. 6 oz. falling to about 2 lbs. 2 oz. For boys 
of five years new tacket boots weigh 1 lb. 14 oz. falling to about 1 1b. 11 0z. when worn down; for boys 
of fourteen the former weigh 3 lbs. 9 oz. and the latter about 3 lbs. 30z. We have to thank Dr Chalmers, 
M.O.H. for.Glasgow, for this information. 

+ Many public elementary school children have great masses of metal on their boots. Undoubtedly 
the older children have heavier boots, and we can see from the diagrams that the divergence of the 
Glasgow children from the Anthropometric Committee’s artizan children increases with age. 

£ Report, Scottish Education Department, 1907, p. iv. 


292 Height and Weight of School Children in Glasgow 


| WRIGHT AND HEIGHT OF GIRLS 


90 Glasgow Data, All Schools. 
B.A. Anthropometric Committee, 
85 Artizan Class. 


65 


Weight in lds, 


> 
Ss) 


Ot 
Or 


Sera 
Ge Ve 
S) 


Height in inches. 


a eat nant tte 


E. M. ELDERTON 


9} WEIGHT AND HEIGHT OF BOYS. 


90 Glasgow Data, All Schools. 
B.A. Anthropometric Committee, 
Artizan Class. 


293 


oS! 


Or 
Or 


Weight in lbs. 


or 
oO 


Orv oO 
oO Fe 


HHIGHT. 


abe) 6 a 8 9 OMe = Ae Ss) U4, tS 


Biometrika x 38 


Height in inches. 


294 


TABLE I. 


Height and Weight of School Children in Glasgow 


Glasgow. 

| Age Group 4, Boys Group B, Boys Group C, Boys | Group D, Boys 
6 | W=—-21:16441:503H | W=—22°57641532H | W=— 24:4894+1591H | W=— 25°6784+ 1-603 
7 | W=—-21-0654+1519H | W= —26:308+1°635H | W=— 27°39041667H | W=— 34:819+1°818H 
8) W=-19°38141:-495H | W=-28:127+1695H |W=—- 6°64041:227H |W=—- 3277741 790H 
| 9| W=—24°65241:635H | W=—32°8264+1-818H4 | W=— 20°67141562H W=-— 36:030+1:883H 
10 | W= —29°589+41-768H | W=—35°696+1:899H | W=— 42°94242°055H | W=—- 51:63642:218H 
| 11 | W=—54:91042302H | W=—39°72542-:005H | W=— 43-628+2-:088H | W=— 67°66442°546H 
| 12) W=—49°5134+2:217H | W= —62:8904+2-476H | W=— 55:°386+42°337H | W=— 62:052+2-450H 
| 13 | W=—65:467+2°547H | W=—-63°51642511H | W=— 81:342+2°854H | W=—- 99:908+43°160H 
14 | W= —83°7844+2°888H | W=—76°749+2775H | W= —103°661+3°251H | W=— 126°446 + 3°633H 

| 

Age Group 4, Girls | Group B, Girls Group C, Girls Group D, Girls 
6 | W=- 14561413298 | W=—-15:98541:345H | W=— 23°72841:551H | W=—- 27°5634+1624H 
7 | W=-19°762+1465H | W= —24:13041:556H | W=— 29°312+1693H | W=— 30°22641694H 
8 | W=—20-7214150383H | W= —30°6224+1-718H | W=— 29°113+1:691H | W=— 40°156 +1:927 H 
9| W=-30°1334+1:730H | W= —29-0814+1°709H | W=— 38°62441917H | W=— 45:0664+2-045H 
10 | W=—36-4784+1:878H | W=—38:8664+1:925H | W=— 46-263+2-088H | W=- 62:015+2397H 
11 | W=—-48-70742:153H | W=- 43:005+2:034H | W=- 51:146+2209H | W=— 57°754+2'330H 
12 |, W= —58°2774+2°360H | W= —63:908+2°465H | W=— -77°316+2°735H | W=— 84:298+42°8597 
13 | W=—%74:15642694H | W=-—88-043+2:939H | W=— 83:960+2:892H | W=—103°594+3:229H 
14 | W=-95:4644+3:084H W=-84:49642:906H | W= —106:1334+3°317H | W= —134:197+3'804H 

; | 


W is weight in lbs., H is height in inches. 
The ages are central ages, and to obtain the weight corresponding 
the child should be taken to the nearest whole year. 


weight on age and not height on age which is non-linear. 


to a given height 


The departure from 


linearity is not great, but Mr H. E. Soper, in order to smooth the material, fitted a 
parabolic surface to the regression surface of weight on height and age. 


Let W be as before the weight in lbs., H =height in inches, and y equal the 


age of the child measured from 10*. 


linear. 


Then 


W=—¢.(y) + ¢o(y) H 


is the form of the surface when the relation of W to H for a given age is sensibly 


Mr Soper now assumed: 


hi(Y= At nyt+t ay, do(y) =b + hy + boy? 


and determined a, a, and as, b), b; and b, so that: 


> {n (bi (y) — a — Hy + aoy”)} = minimum, 
> {n (db. (y) — b, — bry — b.y?)} = minimum, 


where » is the number of individuals in any age group. 


* Thus y takes every value from — 4 to +4 and we have nine equations to deal with. 


Biometrika, Vol. X, Parts I and III Plate XIX 


EPORT . 1906) 


(CLASS B, CLASCOW R 


. Model of Regression Surface, giving mean weight of Girls of Class B of Glasgow Schools for a given 
Height and Age. The mean weight is the vertical coordinate and each section parallel to the front 
of the model gives the mean weight for the several Heights of Girls of a given Age. See p. 295. 


E. M. ELpERTON 295 
Now ¢,(y) and ¢,(y) for given ’s are the values determined in Table I for 
the constants at each age of the regression lines 
W=-—-A+8H, 


and n is the number of children dealt with at each age. Our type equations are 


then of the form: 
Up + 4a, % (ny) + 4a.> (ny?) = 42 (A), 


hay’ (ny) + 4c, (ny?) + ae (ny!) = 43 (Ay) 
4a) & (ny?) + 44, (ny?) + ta. (ny) =1> (Ay), 
and similar equations for b,, b,, b., with B for A. 


When these constants had been determined we put Y=10+y and obtain the 
equations given in Table II. 


TABLE II. 
Glasgow. School Children. 


W= Weight in lbs. Hf =WHeight in inches. Y=True age. 
Boys 
Group A  W= {02181 242-214 —-2554Y} x H- (11327524 67°542-14-7417 F}, 
B W= {01533 Y2+1:900—-"1493V' x H— { 83314Y7+ 53:101- 9°8662Y}, 
C W= {03990 Y243-614— 5796 Y} x H - {2:08397 Y2+4 139-456 — 31-6570 F}, 
D W= {02983 Y?+2°799 — 3636} x H— {176624 Y24111-407 —24-0174Y?. 


” 


Girls 
Group A W= {01880 Y?4+1°657—1644V} x H— { :96454¥?+438:119— 9°7305 7}, 
B W= {02081 Y?+1:907 —-2043 Y} x H— {1:16330 V2 + 60-230 — 13°6925 7, 
4 C W= {02315 V242°222 —-2457YV} x H— {121642 VY? + 67626 — 14°3416)", 
D W= {02701 Y?+2:385 — -2832V} x H— {1:56165 VY? + 87-624 — 18°9239 Y}. 


A model of the surface for Glasgow Girls of Class B has been made by 
Mr Soper. Allowing for the points based on few observations at the end of each 
regression line of weight on height for constant age, the scroll represented by black 
threads is quite a good fit to the observations represented by card sections. The 
model is, however, difficult to photograph in a manner which shows effectively the 
approximation of the thread scroll to the cut card sections. The reader should 
note that an additional thread is placed between each of the threads which 
graduate the regression lines for the different ages. 


Further eight tables (Tables a—0) have been constructed in order that the 
average weight of any boy or girl of a given height and age can be read off at once. 
See pp. 8300—303. It has been stated before that the age groups in Glasgow are 
from 5°5 to 6°5 years etc.and that 6,7, 8 etc. are the centres of each age group, 

38—2 


296 Height and Weight of School Children in Glasgow 


but since frequently the centres are at 6°5, 7°5 etc. we have constructed Table III 
which enables anyone to find mean height and weight at any age between 5°5 and 
14°5 years. The regression lines are calculated from the original tables in which 
children of 45 to 5°5 years were included. The regression lines omitting the 


TABLE III. 
Glasgow. 
| Mean Height : Mean Weight 
y Ge es sl er D Ae a8 G D 
Boys: 5:°5— 6°5 41°3 42°1 42°] 43°0 40°9 42°0 42°'5 43°3 
6Ob— 7h 43°0 44°0 44:0 44°8 44°2 45°6 45-9 46°6 
T'5— 8:5 45°1 45:9 46°2 46°9 48-0 49°6 50°1 51°2 
8'h— 95 47°0 47°7 48°1 49°0 52°3 53°9 54°4 56°3 
9:5—10°5 48°8 49°5 49°9 50°9 56°7 58°4 59°5 61:2 
10°5—11°5 50°6 51°1 51°5 52°6 61°6 62°7 63°9 66°3 
11°5—12°5 52°3 52°8 53°5 54°2 66°4 67°8 69°1 70°8 
12°5—13°5 53°8 54°3 55°0 55°9 lier 72°9 75°6 76°9 
IZ*5—14'5 55°2 55'5 572 57°7 75°6 77°3 82°2 83°2 


Regression on Age | 1800 ins. | 1°728 ins. | 1°847 ins. | 1°846 ins. | 4°305 lbs. | 4°395 Ibs. | 4°772 Ibs. | 4°914 lbs. 


Girls : 


5'b— C5 41°0 42°0 41°9 42°7 39°9 40°6 41°3 
65— 75 42°9 43°7 43°7 44°8 43°0 43°9 44°7 
7'5— 8:5 44°6 45°6 45°6 46°4 46°4 47°7 48°1 
S:5— 95 46°6 47°4 47°6 | 48:6 50°5 51°8 52°7 
9'°5—-10°5 48°5 49°2 49°4 50°4 54:7 55'8 56°9 
10°5—11°5 50°3 51°1 51:2 52°2 59°5 60°8 61°9 
11°5—12°5 52°4 53°0 53°3 54°1 65°3 66°8 68°4 
12°5—13°5 54°4 55°2 55°4 56°5 72°4 74:3 761 
IS*5—14'5 55°8 57°71 57°0 58°7 76°8 81°3 83°0 


Regression on Age | 1°914 ins. | 1°859 ins. | 1°903 ins. | 1°943 ins. | 4°551 lbs. | 5-083 Ibs. | 4-944 Ibs. | 5°489 lbs. 


41°8 
45°6 
49°3 
54°3 
58°8 
64°4 
70°5 
78°8 
89-0 


children of 4°5 to 5°5 years were worked out for height on age and weight on age 
for boys in Group A, and were found to be 1°81 instead of 1°80 for height and 
439 instead of 4°31 for weight, but such differences are not great enough to 
matter and the remaining regression coefficients were not calculated with children 
of five years excluded. 


In connection with the tables (1 to 72, pp. 304—3839) it should be noted that in 
transferring the data for boys from the original sheets to cards, ‘75 of an inch was 
included in the inch above; for example, 30°75 inches was entered as 81 inches 
and the centre of the group of 30 inches is 30°125 inches. The data for the girls 


KE. M. ELpERTON 297 


were transferred to cards much later and the simpler method was employed, and 
30:75 was included in the 30 inch group and the centre of this group is 80°375. 


Through the kindness of Dr Priestley, School Medical Officer for Staffordshire, 
we have been able to obtain the regression of Weight on Height for certain 
age groups of boys and girls in that county. Staffordshire is a county of very 
various occupations and contains an agricultural as well as a mining and factory 
population. 


The children measured are “entrants” and “leavers” and a further group of 
children, namely those from 8 to 9 were measured. The “leavers” include 
children of 12 to 14 years, “since in general the only ‘leavers’ at age 12 to 15 
are rural, and the only ‘leavers’ at age 13 to 14 are urban*.” 


The children were of the age stated, 5 and not yet 6, 8 and not yet 9, on 
January 1, 1911, but the actual day of weighing may have been any school day from 
January to December, so that a child entered as 8 may have been only a few days 
short of 10 when it was actually measured, and therefore the mean age of the 
group of children of 8 to 9 will be 9 years. “In the case of the group of leavers, 18 
to 14, no child can have been more than 14, because on attaining that age the 
children are entitled to leave school, and generally do leave. With these the mean 
height and weight in our tables refer to the true mean of the years of the group, 
viz. 13 and a halft.” We shall table to the middle of the group, namely at ages 6, 
9, 138 and 134. 


The children were weighed and measured without shoes, but in ordinary indoor 
clothes. The figures were read to the nearest quarter of an inch and to the nearest 
quarter of a pound. 


Staffordshire Children. 


GIRLS Boys 
Ages | | | 
Regression of | Regression of 
Mean Mean Taio | Mean | Mean Poe 
Height | Weight | eee Height | Weight ane 
| 
| | 
6 419 | 398 | 1-705 49-1 | 41:0 1-741 
9 47°7 51:1 | 2024 48°1 | 53°0 2°120 
13 56°7 (ek | See, ays) | Thos} 2°811 
13} 57°1 81-0 3°360 56°3 | teat, 3°166 
| 


It will be as well to compare these means with those for all Glasgow ; so far we 
have not given them in this paper for all the schools taken together but only for 
each school group. 


* Staffordshire County Council, Annual Report of the School Medical Officer for the Year 1911. 
J. and C. Mort, Ltd., 39, Greengate Street, Stafford, 1912. 
+ Ibid. p. 25. 


298 Height and Weight of School Children in Glasgow 


Tlasgow Children. 


GIRLS Boys 
Ages 
Mean Mean Mean Mean 
Height Weight Height Weight 
6 41°7 40°5 41°9 41°8 
9 47°3 51°3 AlT(OT | 53°7 
Le Yar 74°8 54°6 | 73°6 


Girls in Staffordshire are taller at ages 6, 9, and 13 than girls in Glasgow, but 
they are lighter at ages 6 and 9. We might argue from this a lack of physique in 
Staffordshire girls who are absolutely ‘7 Ibs. lighter at age 6 than Glasgow children, 
and relatively to their height even more than this amount. At age 9 the absolute 
difference is less and at age 13 Staffordshire girls are heavier than Glasgow girls 
but they are 14 inches taller, and since the regression of weight on height at 
age 13 for girls is 3°272 lbs. we should expect Staffordshire girls to be 49 lbs. 
heavier than Glasgow girls, but they are not so much. I should hesitate to say 
that the physique of Staffordshire girls is inferior to that of Glasgow girls; the 
difference probably is one of race, but such questions must remain unsolved till we 
have a far wider range of anthropometric data than is available at present for all 
the districts of Great Britain. Boys show the same characteristics to a lesser 
extent; Staffordshire boys are taller at ages 6, 9, and 13, but they are lighter in 
weight; at age 6 they are ‘8 lbs. lighter than Glasgow boys; at age 9 they are ‘7 lbs. 
lighter and at age 13 they are 1:7 lbs. heavier, Again relative to their height 
Staffordshire boys are lighter than Glasgow boys at the three ages for which a 
comparison can be made. 


Comparing boys and girls in Staffordshire we find that girls of 6 and 9 are 
shorter and lighter than boys of the same age, but at 13 and 13% girls are both 
taller and heavier. At 6 and 9 years the regression of weight on height is 
practically the same for both sexes, but at 13 and 13} the regression of weight on 
height is greater for girls than for boys; girls are heavier proportionally to their 
height than boys are. For girls of 13 an additional ich in height should mean 
3°3 lbs. more weight while for boys the additional pounds expected are only 2°8, 
while for girls of 134 we expect 3:4 lbs. increase for every inch of growth and for 
boys 3:2 lbs. increase. A comparison of the regression coefficients with those given 
for Glasgow in Table I will show that the coefficient is higher in Staffordshire for 
children of 6 and boys of 9 than in any of the school groups in Glasgow. The 
regression coefficient found for girls of 9 and 13 in Staffordshire is practically 
identical with that found in Group D in Glasgow, and boys of 13 in Staffordshire 
would seem to be most like boys of Group Cin Glasgow. 


EK. M. ELpERtTon 299 


In a Drapers’ Company Research Memoir * recently published tables are given 
showing the height and weight of boys and girls of 12 to 13 years who were 
members of the Worcestershire public elementary schools. These tables are 
XLIV and LIX and will be found on pp. 100 and 107 of the work cited; we 
have calculated the mean heights and weights and the regression coefficient of 
weight on height as we have done for Glasgow and Staffordshire. The mean age 
of the group of children of 12 to 13 years is 12°5 years, so allowance must be 
made for the six months age difference in comparing with the Glasgow data. We 
have already given the mean heights and weights of Glasgow and Staffordshire 
boys and girls of age 13 so we will calculate what the height and weight of 
Worcestershire children at age 13 would be. An additional year makes a difference 
of roughly 1:9 inches and 49 lbs. in the height and weight of a girl and of 
1°S inches and 4:6 lbs. in the height and weight of a boy. 


GIRLS Boys 
, Regression ; | Regression 
een eee of Weight Ea ioa Gone of Weight 
eigh eig on Height eigh eig On EeiHe 
12°5) Eat Sen ee 55°2 72°9 2°829 54°6 eal 2°800 
emcee estes Fea | 75.8 = BBS | 744 em 
18 Staffordshire 56°7 780+ = 55°8 ToL3 = 
13 Glasgow 55°2 74:8 — 54°6 73°6 —- 


Worcestershire children are taller than Glasgow children but slightly shorter 
than Staffordshire children. They are also rather heavier than Glasgow children 
but not relatively to their height. The height of Worcestershire children of 
12°5 years is the same as the height of Glasgow children of 13 years, but the 
weight of girls is 2 lbs. less and of boys is 14 lbs. less. Worcestershire children 
are lighter than Staffordshire children, but when allowance is made for the 
difference in height the Worcestershire children are not much at a disadvantage ; 
girls are a pound lighter and the weight of boys is practically the same. 


The differences we have found between the Worcestershire, Staffordshire and 
Glasgow children may well be due to differences of local race, and not be the 
results of differential environment or nurture. We should have little hesitation in 
applying the returns for Glasgow children as an approximate standard—say to the 
lb. and inch—for all British children of the artizan classes. 


* «A Statistical Study of Oral Temperatures in School Children with special reference to Parental 
Environment and Class Differences,” by M. H. Williams, Julia Bell and Karl Pearson. Studies in 
National Deterioration, IX. 1914. Dulau and Co., Ltd., 37, Soho Square, W. 

+ This weight appears somewhat exaggerated. It may in part be due to local differences in the 
average ages of ‘leavers.’ 


300 


Height and Weight of School Children in Glasgow 


TABLE a GLASGOW. BOYS. GROUP A*. 
Weights for Height at each Age. 
Actual Age. 
6 7 8 9 10 11 12 13 14 
Oo gees.5 = =| ea = — -- _ — 
34 | 30°0 | 31-0 — | = — — — — = 
86 Weslo i325 = == — = — — — 
Ae | SERB oe esi ip) — — — — 
oy, 34:4 35:4) |) = 35:8a — | = — —- | — 
38 35:8 36°9 | 37:4 3iE3 ne — — |) = 
39 37°3 38-4 39°0 | 39:0 38-4 | — —_ — — 
jo | 388 | 39°99 | 40° | 406 | 40:2 | 39:3 = — = 
Bt | 40:3) |) 14 498] AD 30 249 es eles = = = 
y2 40°7 | 499. | 43°71 “4ahOs |ex43-08 | BaB:4ye) w49-a = — 
48 | 43°2 | 44-4 | 45:2 | AB 7 4537 9) 45-4 |) 44-7 AO Gum 
44 | 44:6 | 45°99 | 46°83 | 47-49) 47-6" || 47-5 |) 47-0) 46-2 eee 
S| 258) 46s Ay a de Ae eo 494 | 49° | 49:3 | 48-7 | 47-9 
"oo | 46 | 47:6 | 48-90) 49:9: | 50-7 =| 5IS3: | 51-5) || (51-6) || IcSenenoee 
"©.| 4% | 491 | 50-4 |" 61-5 9) 5974 5351 11 653-6" | 538) | 53 :omen aa 
| 48 | 50S | 51:9 | 53-1 | 54: 54-9.) 556 | 56-1) 756 Deamenoee 
49 _ 53-4 | 54°6. | 55°83 |. 56:8 | 57-7 | 58:4 | 59-1 | 59°6 
50 -- 54°9 56°2 57:5 || 58:6 59°7 60°7 61°6 62°5 
51 = = 57°8 | 59-2 | 60:5 | 61:8 | 63:0 || 164-2, Senet 
52 = = 59°38 _| 60-8 | 62:3 |-63°8 | 65°3° || 66:8. j\m68:3 
53 dn a 60:9 | 62:5 | 64:2 | 65:8 | 67°6. | 69:40") sD 
54 | 64:2 | 66:0 | 67:99 | 69°9 | 72:0 | 74-1 
55 | 65:9 67°8 69°9 72°2 74°5 77°0 
| 86 | 67°6 69°7 72-0 745 77:1 79°9 
eae: 69:2 | 71:5: || 74:0 —| 716-7 | 70-7 aalesono 
eta te 2 alee tales 734 | 761 | 79:0 | 82-3 | 85-8 
59 fy aisle 81°3 84°9 88°7 
60 ae a 83:6 | 87-4 | 916 
61 : 90°0 | 94°5 
put = | : ae ae = 97-4 
TABLE 6. GLASGOW. BOYS. GROUP B. 
Weights for Height at each Age. 
Actual Age. 
| asta 7 8 9 10 u 12 13 U 
| 
[ego Sle 7c5 — | — = = = — = — 
34 | 29:0 — | — = = = — — — 
Sh. Nis80-C@ie ola pa ae = = — — = 
cies epi | Soni Wy: as — _ — 
73307 ile sasGe | reso = — — 
SBN) 8572) 13620 36.6 = — — — — — 
39. | "36°8 Nav:8, noes a a = — = = 
40 | 383 | 39°4-}-40:0 | 4071 | 39°8 — — — _ 
41 | 39-9 “| 40:0.) 47a S41) eae — — — — 
42 | -A0-5: |) 42:6) 4) AB *Ses | A387 eo 8 = —_ — 
8 | 43-0 | 44-9 4570 1 45-5 45-7 45-5 — — — 
yb | +446) | 45:8 9 467) 47 -Sie4 76am 47-6 — — — 
a | ae | 465 1 474 4849 40-159) 49-5) 40-7 eoes = = 
| 46) 47:7. | 49:02) 5051) 50297, 51-5) eS ma eee sel MD IEG — 
20) 47 | 49-2 | 50-6) | 51:8 | 52-7 | 253-40 653-0 erode | ede? cal serorad, 
| 48 | 508 | 522 | 53:5 | 54:5 | 55-4 | 560 | 565 | 567 | 568 
49 | 82:4 | 53:8 | 55-2 |) 56:3 | 5763 9) 87s | 68'S | 59:3 ailoore 
50 | 53°9 | 55:4 |.-56°8 -| 581 | 59:29) 60:2 ss6l-1 | Ci-8 mao 
51 — 57-0. | 58'5 | 59:9 || 61-20) (62745 63:4) |) 64-4en Ga 
52 a 58°7 |) 6020 Glave ose 645 | 65:7 | 66:9 | 68-1 
58 — = 61:9 | 63:5 | 651 | 66°6 | 68-1 | 69:5 | 70:9 
54 65°3 67:0 68°7 70-4 72°0 73°7 
55 6771) | G8r9= | 70:8. 7277 || e746 vers 
56 — = — —- 70°9 72:9 75°0 dell 79°3 
57 — = — -— 72°8 75°0 77°3 om 82:1 
58 Tel) 79261), (82520 ee beO 
59 a = | 79°3 | 81:9 | 84:8 | 87°8 
60 — = — | — —- — 84°3 87°3 90°6 
61 | = — 89°9 | 93-4 
62 | — — -- — 92°4 96°2 
63 | — —- 95°0 99°0 


* Throughout weights are given in lbs., 


heights in inches. 


E. M. Evprerton 301 


TABLE y. GLASGOW. BOYS. GROUP 0. 
Weights for Height at each Age. 
Actual Age. 


eee 
6 Ge 8 9 10 ol 12 13 10} 
35 30°5 as = =— = = — a — 
36 SOE al ae = = = = = — 
37 33°7 36-0 — = = — — — — 
38 35:2 37°5 38-6 = = — — — a 
39 36°8 39-0 40°1 = = = = —~ = 
40 38-4 40°5 41°7 41°8 = = = = = 
41 40-0 42-0 43-2 43°5 = == = = = 
42 ALD 43°5 44°7 45‘1 44-7 = = = = 
43 43:1 45-0 46°3 46°7 46°5 = = = = 
th 44:7 46°6 47°8 484 48:3 47°5 ae = = 
45 46:2 48'1 49°3 50:0 501 49°6 48°5 = = 
4G 47'8 49-6 50°8 516 51:9 51°7 50:9 = — 
Wee 49-4 51-1 52°4 53-2 53-7 53°7 53°3 = = 
We 3 51:0 52°6 53-9 54:9 55-5 55°8 55:7 55-4 = 
2) _— 54:1 55-4 56°5 57°3 579 58-2 58-2 = 
"en | 50 _ 55‘6 57:0 58:1 | 59-1 59°9 60°6 51-0 as 
®| 51 ere ae 58°5 59:8 60:9 62:0 63-0 63°8 64:6 
a) 52 tes yes 60:0 | 61:4 | 62:7 | 641 | 65-4 | 66-7 | 67-9 
53 as = 61-6 63:0 64:5 66°1 67'8 69°5 71:2 
54 as = 63'1 64:7 66-4 68-2 70:2 72°3 74°6 
55 = aah == 66°3 68-2 70°3 72°6 75°1 77-9 
56 = os = ae 70-0 72°3 75-0 77-9 81:2 
5Y = a 74:4 77°4 80°8 84°5 
58 =e ae = aa a 76°5 79°8 83°6 87°8 
59 ane ii ese z 82-2 86:4 91:2 
60 a a8 846 89-2 94°5 
61 el rane = at 92+1 97°8 
62 as aes es diss 94:9 | 101°1 
63 = Se a = 97°7 | 104°4 
64 aos ae = = Zs Js = — | 10738 
65 as = z = <e = as = 111-1 
66 ate = A — | 114-4 
67 —* aE yey 
TABLE 6. GLASGOW. BOYS. GROUP D. 
Weights for Height at each Age. 
Actual Age. 
6 oy, 8 9 10 aD 12 13 Lh 
37 31°7 = = me: = ae = = 3 
38 33-4 =a aa we = me = Z te 
89 | 35'1 37°1 = _ = — — — — | 
40 36°7 38°8 39°6 = re: = = — — 
41 38-4 40°5 41:4 aa = 2 = — — 
42 40°1 422 43-2 43°3 =e = = — _ 
43 41°8 43-9 45-0 A Ole _ = — — 
dA 43°5 45°6 46°8 47°1 46°5 = =e — — 
45 | 452 | 47:3 | 486 | 491 | 48-7 = _ — = 
4b 46°9 49°1 50-4 51:0 | 50°8 49°8 = — — 
Ay 48°6 50°8 52-2 53-0 53-0 52-3 50:8 — — 
48 50°3 52-5 54:0 54:9 55'1 54:7 53-5 = a 
19 ae 542 55'8 56:9 57:3 57'1 56:3 54-9 = 
| 40 = 55:9 576 58'8 59°4 59°5 59-0 58-0 = 
Bo | ol a ze 59-4 60°7 61°6 61:9 61°7 61°1 59:9 
Ra D2. — =. 61-2 62°7 63°7 64:3 64:5 64:2 63°5 
| 53 = ae = 64°6 65'8 66°7 67:2 67°3 67:0 
54 66°6 68°0 69°] 69°9 70°4 70°6 
OMe es — SMES OR Me pOg emi. | WG. || o78:5° |) 97422 
56 va oe = a 72°3 73:9 75°4 76°6 Wiel 
5Y s = —_ cy ae 76:3 78°1 79°8 81:3 
58 a _ 78°7 80°8 82-9 84:8 
59 as =e uae = =o 81:2 83°6 86-0 88-4 
60 ets pla ms 86°3 89°1 91:9 
61 os ae ie ans aa = 89-0 92-2 95°5 
62 a = == Lee aes = — 95°3 99-0 
63 we a es rs = = fs 98:4 | 102°6 
64 an = = a ii Sy a == 106-1 
65 aa ae oo as = ze = te 109°7 
66 ba) oes — mY a = i _ 113°3 
67 23 = ct as he an aa = 116°8 


Biometrika x 89 


302 Height and Weight of School Children in Glasgow 


TABLE « GLASGOW. GIRLS. GROUP 4. 


Weights for Height at 
Actual Age. 


each Age. 


6 2 8 9 10 11 12 13 1h 
33 | 30:0 — — — -- _ — — — 

8) 81:3" | 3ic3 = — — — — — _ 

35 32°7 32-7 3251 = — — — — — 

36 3420) |) adel 33°6 = — — = — — 

Bre W354) 85:5 35°2 34:2 — ~ = == = 

38 | 36°7 SycOresGen 35°9 — — — = — 

89 "| 38:1 3874: Wl 38:2 37°6 36°6 = = — — 

40 | 39°4 | 39:8 39°8 39°3 38°5 = st ae Es 

Ad 40°38 | 41-2 41°3 41°0 | 40:3 39°3 — — — 

By) “42-1 42°7 429 42°7 42°2 | 41:4 = _ — 

Galle Oreo) ececil 44:4 | 44:4 | 44-1 43°5 42-6 = — 

yh 44°38 | 45:5 46-0 46°] 46°0 | 45°6 45-0 = — 

3 | go | 46:2 47-0 W475 47°8 | 47-9 478 | 47-4 46°7 — 
| 46 47°5 48°4 49°1 49°5 49°8 499 49°8 49°4 — 
OU 7 = 49°8 50°6 51-2 51:7 52:0 52°1 52-1 = 
x 48 = 51:2 52:2 52:9 53°6 54:1 54°5 54:8 55:0 
49 = = 53°7 546 | 55-5 56°3 56°9 575 58-0 

50 == — 55-2 56°3 57-4 | 58-4 59:3 | 60-2 61-1 

51 = — = 58-0 59°3 60°5 61°7 62°9 64:1 
52 = — = 59°7 61-2 | 62:6 | 64:1 65°6 | 67:2 
53 — — = 61:4 63°71 64:7 66°5 68°3 70°2 
54 — = = = 65:0 | 66-9 68:9 71-0 73°2 
55 = — = = 66°8 69-0 Tes 1331 76°3 
56 ss = = = 68°7 fad 1377 7674 79°3 
57 = a 73:2 76°1 79°71 82°4 
58 Ss = — a — 75°4 78°5 81°8 85-4 
59 = — — 80°83 | 84°5 | 88-4 
60 — = — = = = SBE IE 91°5 
61 = a = oe 2 a as 89:9 | 94:5 
62 = = on = = a = 92°6 | 97-6 
63 = = = = = = —_ = 100°6 

TABLE ¢ GLASGOW. GIRLS. GROUP B. 
Weights for Height at each Age. 
Actual Age. 

6 fg 8 9 10 il 12 13 Uy 

33 | 272 — = — = = = — = 

3h 28°7 — = -- — _ — — — 

35 30°1 31-0 = — — — — — = 

| 36 31°5 32°5 as — = — — = =: 

3 33:0 | 34:0 34°2 — — — — — — 

38 344 35°5 35°8 35-4 — — — — — 

39 35°8 37-0 S74 eer oree 36-2 = — — — 

40 1 372 38°5 39:0 | 38-9 38-2 = = — — 

| 41 38°7 40:0 | 406 40°7 40°1 38-9 = - 

| 42 40°1 41°5 42°2 APACHE A9: OMe All = =~ = 

UB A Ate 430 | 43:8 44:2 44:0 | 43:3 | 42:0 = — 

Ah 43-0 44:4 45-4 45-9 45°9 45-4 | 44:4 = — 
ous 44-4 45°9 ATO: |, e477 47-9 47-6 | 46°9 — -- 
rots) 146 45°8 47-4 48°6 | 49:4 | 49°38 | 49:8 | 49:4 | 48-5 — 
2 4 47°3 48°9 50°3 51-2 51°8 52-0. 4 \5.51°3 51:3 == 
mr 48 48°7 50°4 51°9 52°9 53°7 542 | 54:3 | 54:0 — 
49 = 51-9 53°5 54°7 55°7 boo 56-7, 56°8 56:6 

50 = = 55°1 56°5 57°6 58°5 59-2 59°6 59-7 

51 = a 56°7 58-2 59°5 60°7 61°6 62°3 | 62-9 
52 = = a 60:0 | 61:5 62°9 | 64:1 65°1 66-0 

58 — = = 61°7 63:4 | 65:0 | 66:5 | 67:9) "edu 

54 ote Ze, = a 65:4 | 967-2 69-0 70°6 72:2 
55 = = = = 673 | 69:4 | -71-4 73°4 754 

56 = os = -- 69°3 TUGeal es Foro 76°2 78°5 

57 = = a — = foul 76°3 78°9 | 81-6 

58 = ae 75°9 78°8 81°7 84-7 

59 a — = = = 78:1 81:2 | 84:5 87-9 

60 = = a 83:7 87-2 91-0 

61 = = = — = — 86°1 90°0 | 94:1 

62 a = = = ae = = 92°8 | 97-2 

63 — = = — — — = = 100-4 


lk 


HK. M. ELDERTON 303 


TABLE 7» GLASGOW. GIRLS. GROUP C. 
Weight for Height at each Age. 
Actual Age. 


Height. 


Height. 


7 8 9 10 ib 12 18 14 
30°0 — == oes = a — aad ad 
315 = — — — = = = = 
33°1 33°7 = = ra a er aa aps 
34:7 35°3 35°3 — = = = = — 
36°3 37°0 37°0 — — = = = = 
37°9 38°6 38°8 38°3 —_ = = = ee 
39°4 40°2 40°5 40°2 — —_— _ = = 
Al‘O | 41-9 | 42:2 | 4271 415 — = — — 
42°6 43°5 44-0 44:0 43°6 42°7 — — —_ 
44:2 | 45-1 45°7 | 45°9 | 45:6 | 45-0 = — — 
45°8 46°8 47 °4 47°8 47°7 47°3 46°6 = = 
47°3 48°4 49°2 49°6 49°8 49°7 49°2, — = 
48°9 50°1 50°9 51°5 51°9 52°0 51°8 —_ — 

Blew soa, at | |ebacO 9 big | ebdea | bas — 
53°3 54°4 55:3 56°0 56°6 57°0 57°3 —_ 

— 5671 57°2 58°1 58°9 59°6 60°2 —_— 

— 57°9 59°1 60:2 61:3 62°3 63°2 64:0 

— — 61°0 62°3 63°6 64°9 66°1 67°3 

= — 62°8 64°4 65°9 67°5 69°0 70°6 

— —- _- 66°4 68°2 70°1 72°0 74:0 

— — — 68°5 70°5 72°7 74:9 77°3 

— — — 70°6 72'9 75°3 77:9 80°6 

75:2 779 80°8 83°9 

—- 775 80°5 83°7 87°2 

- “= 79°8 83°1 86°7 90°6 
= — — — — 85°7 89°6 93°9 

— — = 92°6 97°2 

— — 95°5 100°5 

— — — -- — oe 98°4 103°8 

— — = - — — -- 107°2 

TABLE 6. GLASGOW. GIRLS. GROUP D. 
Weight for Height at each Age. 
Actual Age. 

7 8 9 10 11 a2 13 14 
29°4 — — — = —_— —_ — — 
31-1 — — — — — — —_— —_— 
32°7 34°0 —_— — = — —_— = —_ 
Sule Ml Sein — — -- — — — — 
36'1 37°4 37°8 = —_ — —_ — 
or. | a9 396. | — we = = = = 
39:4 || 40:9 | 415 ,| 41-2 = 
41:0 42°6 43°3 43°3 — — — — -- 
DOT at Seno A Abs leaaey = = m= — 
44°4 46°0 47°0 | 47°3 46°9 — — — —- 
46°0 47°8 48°9 | 49°3 49°2 48°4 — = — 
47-7 | 49% | 50°7 | 514 | 514 | 509 | 49°88 me = 
AG Sete pine |) “52°Gr tlk (bo4e sles 7 Bale leony — — 

52-9 | 54:4 | 554 | 559 | 560 | 55:6 = = 
54°7 56°3 | 57°4 58°2 58°5 58°4 57°9 — 

— 581 | 59°5 60°4 611 61°3 61°2 60°7 | 

— - 615 | 627 | 63:6 | 64:2 | 64:4 | 64:4 | 

= = 63-5 | 65-0 | 66:1 Gil. 6 W681 

— — 65°5 67°2 68°7 69°9 71:0 71°8 

— —_— —_ 69°5 We2 72°8 74:3 75°5 

ly a= = Seer iss ier is. |) 79%3 
76°3 78°6 80°8 83°0 

— 78°8 81°4 84:1 86-7 

| 8l°4 84°3 87°3 90°4 | 
= i= = — 87°2 90°6 94°1 

— — —_ = = 90°1 93°9 97°8 

= = = = — — 97-1 101°6 

ee = = = _ — | 100-4 | 105-3 
jo = = — — —- | =— 109°0 
= = = —_ — — — 112°7 


304 Height and Weight of School Children in Glasgow 


TABLE 1. Glasgow. Height and Weight of Boys of Group A. 


Weight of Boys of 5:5—6°5 years 

eve ! 

OC en ror mey Ow ho kh abel | f [Totals 
& 29 ~~ Dd my § Ww = on) ban! Cale is) fa “ey el Sai Mel Re ios 
R®eaens HS HH HH SF SHSsBsoy 6 HH Ss 
1 1 — Se ST 2 
— 1 — 1 
—1-— 3 2 1 SS 5 = U 
—— 455 1—- — — —- 15 
—— 2 25 8 3 I] —_— 21 
— 1— 21714 10 6 1 1 - - 52 
——— 1725 16 15 5 1. 1——~— 11—-—— — 7 
——— 11020 36 34 15 16 2 — 134 
———— 415 32 48 35 20 4 2 - — — — 155 
———— 1 565 Ill 54 61 52 27 2 1— 1 —- 1 216 
——— 1— 1 6 22 56 52 3116 7 1 1— —— — — 194 
———— 11 2 10 21 54 42 26 7 1 2— — — — — 167 
- = 5 7 22 2229515 7 31 1—— — 108 

1 4 4 11 714 7 2—— —— — 50 

1 1 2 4 810 5 24 —— — — 37 

—- — -— T—- 142 —— — — 8 
= — —_- —— = 1 — 1 

— if 1 — 1 3 

1 3 7 15 52 91 117 192 206 224 145 87 54 22 16 8 2 1 — 1] 1244 


TABLE 2. Glasgow. Height and Weight of Boys of Group A. 


Weight of Boys of 6°5—7°‘5 years 


poheds side 


J 
1 
> 


OP a 


a 
Wi A ~T~119 
e —_— 
Pte 
bo bo © ~71 Oo 


153 220 225 152 


Totals 


E. M. ELprrton 305 
TABLE 3. Glasgow. Height and Weight of Boys of Group A. 
Weight of Boys of 7°5—8°5 years 
Beit yy yt | | bol 2 eae eae ee 
WOH HHH HH HW H G BH BW BW 6 15 GH GH YH H 8 BH HB 19 
S Sas es os es + SRE Sees eEsassssseRr 
P53 |e — — —— - = 1 
aie 2 — -———— — — 3 
oh | || a es = = 2 
OF 0 || area ces ee SS 4 
eee 1 10 
liars ——— 265 43 2 2 = = — 18 
Cee 12 6.13 3 2 : a == ==) “95 
Cae 3 1010 8 4 — = = 2 32 
gO, {——— 111119 1 7 6 38 = 64 
Mipeete = 2 4° 5 18 23 92 11 7 2 5 pe et 95 
eee 9 4 14 36 53 24 87 «(4 — a fe Se eee aie! 
eee A 39 44 60 438 15-1 - : 222 
Lie — on 1 1 22 46 52 53 34 18 611 a 244 
S| _—————s Ti 69 3054-43 83°91 6 8 211 
LE ,, ee rd) 14. 81 33.234-19 19.8 1 1 164 
2? oo) oe eM ly Weise Sih 16 99 T8 7 T 124 
Bo) -— ———= 1 ee o ee WG IAN 4: O19 Seno be 74 
S| = Se ee leh SiMe 2 =e ee eee 31 
Oe = jt ee et ee Omi OS i eg eH 15 
| 7 oe oa a 
Se ee le 
Totals] 1 1 1 18 24 56 89 157 194 202 249 159 150 83 78 47 21 8 8 1 — 1 — 1 | 1535 
TABLE 4 Glasgow. Height and Weight of Boys of Group A. 
Weight of Boys of 8'5—9°5 years 
Height} | | | | | Peeeiee Oe l wlied Rl beth ht SL | op Petals 
DD WS eS ee Se De ee te 
Ras es 5 O22 BR SR BS BESSSSESRRKREELNS 
—~ | aaa zs = 1 
le SS i 1 _ = 3 
= 1 Sie == 1 2 = = = 5 
1 1 a= iz 1 = = 3 
= == I) aah a ee 1 = (14 
= = 1 SS eee — SS SS = is 
Pmt 469 9 6-4. 1. 8 = 1 = 39 
a 21710 8 Ae AA = 49 
eee 9-11.98 95 18° 15. 6 > 4 99 
= = a 28.19 83 42 80 18 6. 2 160 
ot 615 23 43 45 33 23 11 6 2 “ —— | 907 
B —— — 2 — 1 5 10 27 44 49 37 2510 3 2 = 216 
AT yy J— — — — — elon 5 23ne40Ne 360952) 20) ON 1 - 206 
48 4, | — 1 Sao 16m 2) A026 Reso] = 152 
49, J —— — — — — — — 69 20 1914 18 4 Or 100 
50 ,, = 1 Ceo miomONN wtcci = 1a he 
51 ,, — — eee OM geeteG al or aie ee se 30 
52 ,, — — ee Nee re re eee Te od a 12 
53 J—— — — — — — 1 TMs 92} Smeg es ee eek ee 7 
54 ” — —= 1— — 1 1 — 1 1 — 5 
55 ,, = — = 1 3 — 1 — —- 1 6 
56 ,, 2 
60 ,, 2 
Gils 1 | 
1e2s 6-3-9 18 67 93.118 162 176 145 165 98 56 51 5% | 


Totals 


306 Height and Weight of School Children in Glasgow 


TABLE 5. Glasgow. Height and Weight of Boys of Group A. 


Weight of Boys of 9°5—10°5 years 


ee ee eee de Pe Pt te fe ry 
REY, TELIA CLOSE aE LC ILC) wD i) > iY) WwW WwW ay eiay Wey ey eye ev Eley ale) 
SMR AA HQKR &@ HH © K ® HH HK QAAH 
He HH HnmeS Ss FS WD WD vey wD SOF RO GECOMICOU ECO] oe 9 e 


35 ,, 
36 ,, 
or is 
eae = iT = ‘ = 
89, he I 1 . 
Oe. nse ele aoe al Dek - a 
ie ee gle oo eee eel Be boda 
\42, [—-——2 3325 2 1 : : 
43 ,, ae ee eee Eee i Bye I sl ile = ge: 
i ee eS PA ee et 
Les, | = — — == 2 1024 oS rei ele omenon | 2 
Hekapeny || (ees ee L Be 142 oon Oo) 19 OG) 7 46 aT zs BS 
fier 3) 2 8 : ] 
10 > 1 318 6 I 
50 1 929. % 19).19' 27 381616 6 4 — (== 
51s fe al ea el aispalsy okie oy) 
b2) 5. = - 4 al 2 4° 7-9) 19" 9) 7 13. 03) 
53 ,, 8 44 51 3) eee 
= eA Ss i Tl 


Totals 


2 1 3 9 23 19 45 95 114 126 142 205 142 119 121 77 59 38 15 9 10 3 


TABLE 6. Glasgow. Height and Weight of Boys of Group A. 
Weight of Boys of 10°5—11°5 years 

Hoan 1 tl ba) Dea 

1 1D WH 1D 19 WH 19 1 1D WH IO 16 

mM MWA BH DWH KR OADM HOR 

© Dow Oe DMOMOeRe eRe eRe RH TH HH O&O 
348 1 
Cue 1 
38 ,, 
39 5, 1 3 
| 40 ,, Leal! 2 
|41, ._—1—— 22——— 1 = = 6 
| 42 5, an 3 = ee Se 4 
eS ECT a Wee, BOA Bey — al 14 
2A oe | a es iy Gr By 4 = = 24 
Vaio =e eS Ol yl oO mae ma 2 o- ae ae 55 
46 cee =. ie iy 7” 0 eSal4 er Geen ome ee? == oe = 72 
[gee PCS ae Oe 8 e181 3028 eae 103 
oe ——— By Din OHS be OAs hy ily a 8} a ss =) \ as 
en (en ee nn ee Me epi atn Reap) GC) se) 191 
| 50,5. [= —— — —  F 2. 3 7 116 193) 727 48 oS 181 
Pye apr |f eae Ces age, See at V— 1 5) eS, 18) (354 eC Oniieoneo 3 167 
52 ,, = 9 = 188 A RCO RICE OO een 117 
| 53 ,, = : CE Gh MES ees TE IG BRS) BY i 99 
Be | = 1 Xk 3 3 H Sig BA = i= jl 44 
55 4, ae SS ee es ee 25 
56 ,, feces CE ee re Ie ee eee ame 1 oe ae 10 
15 tena (pea OR eg ——— = = 1 Se | ee 4 
| 68, J— — — — — — — = == = — — 
P9ns, afte ee SS aah ea ee 2 
_ a Ee Ln ee 
Totals] 2 2 4 6 15 41 35 61 73 158 127 125 160 114 82 76 70 48 321817 1 3 5 2 1 | 1278 


EK. M. EvprErton 307 


TABLE 7. Glasgow. Height and Weight of Boys of Group A. 


Weight of Boys of 11:5—12°5 years 
Lees Cee ll bl Lat | pQotals 
HS Be A 1) Go Wey Rey Be SE eS ap pete tae ears aie) 
DBO RwHaeHHSA K SF H® BO DBRMOAHHFMFR OH ODA 
QO SFS Soo © SF GF GF GCKHKHHDHAAAAAS 
as ere ES sy = 
—— — — IL — = 
———— 242 1 — 1 — = = 
——1%13844 2 — 
—--— 1 212 6 5 — 1 2 == 
—— 1l— 4722 14 7 1— 1— 
———/— JF slr20 16 ll & 3 1 1 - 
——— 1— 717 27 40 16 14 2 1 l= —_— 
— —— —— 110 33 41 33 22 12 3— 1 - 
—- —- — — — 1138 14 30 41 36 20 7 6 1 - 
Oe. == Il = Go 1G Sl 38 si lb 7 1 1 2 
iS, 1 5 8 16 28 312718 4 2 2 
a — — — pe ge 8 1 93 99coT 4 9 
6 ——————————_ udder Some 1O.uS ie Oa b i= oh Ip ee ee Oe 
56 = 1h, lee Une Gat | 
Si | te 1 ie ee ey oti Oe eee | 
58 ., Tine et er 
59 ,, 1 1 | 
60", = | 
Gi, | 
62 ,, 
6355, 
O4 ” 
Totals 16 50 98 160 163 143 91 78 43 26 12 
TABLE 8. Glasgow. Height and Weight of Boys of Group A. 
Weight of Boys of 12°5—13°5 years | 
| 
Height} | | eee ee il oe Tt el ts otals i 
ye ON Ia 29) SIO! AG a Ag; AG, | 1D 9 ©) ©) W WwW HW WwW | 
2 ££ 4 8&8 ® ® FH © DB ® BH BH B® HF B 
SS ey Oe PR Se BO" OG SG SS 
rn ee ea | 
” 
4A ” aa 1 5 cas 6 
LEB, og ft — 1 3 1 —_- — 6 
4G ,, — 2 4 5 1 12 
oe = 2 8 13 68 = = = 27 
48 ,, 2 ay Sy dkey yi 1 1 —_- — 41 
HOS ee 2s 18s 7 Ti <6" Al 66 
Oem ce = 7 97 29 «17 «10, 8 : 93 
BIS 1 — 1 8 26 34 40 22 6 1 1 — 140 
02 5, — — — 38 21 46 51 33 22 6 4 2 188 
oe oe 7 84 AB Be TI 8 i 1 160 | 
pe Ad, 20) 45. 87-18. 2 1 140 
ene) 8 8, 10 138° 88516. 10 FO. 8 
56 ,, ae uF eo Mtoe eG I) Qo 89 
es 1 ee 10. 10" <8) 4 8 = 49 
5B || aS ee ee Oe eee eas eee ee a a 33 
KD a a a oo Ge: 22 
60 ,, a Wor 23 4 
61 .,. a a oe eh ee at 7 
62 y, = ee ees We ae 2 
3 ” 
64 ,, —_— — — —- 1— — 1 
Totals 4 8 21 67 189 172 205 222 155 71 59 37 16 17 7 1 «1 1202 


308 


Height and Weight of School Children in Glasgow 


TABLE 9. Glasgow. Height and Weight of Boys of Group A. 


Weight of Boys of 13°5—14°5 years 


wD 
S 


ne Se ye Bp eS ee J S 1 ad u 
aS Wey) D> 99 ™~ wD Sy 99 — ia wD d Sa) = > 
Sop sys tis Sy ee! re) tase tase | IS) or ES SSS aS 2 
42: : - 
43 [ane ae ee 
44 1 1 = 
| 45 —- — — 1] — - = = 
| 46 — # 1 -- - — = 
ee = a a = 2 Se 
Sins — = ta al — — 
49 ,, 1°05 ahD 0 dered a eo = 
50 ,, cme eee eyercc oO = = = 
il x, — — 1 6 5 10 8 8 == = 
ID — — — 411 22 4 1 #1 - 
teh — == 1 42 AGS oe 4 eis ee ae = 
bh, Wee ee Se ee eel SS a 
eee (eee es Mee rl le 2S | 
56 ,, 2: M4 10) 10" 1a Ge Lj = Sea 
alte, 1 B62) Oe h8u 5) 4s = I — 
| Res os = — 565 6 4 2 2 4 1 — —~ ~— — 
59 ,, = = 1M Sie 1s 2 eee 
| 60 ,, —- —-— 2— 212 1—- => — 
61 ,, = Phat 1 = 
62 ,, =e = DS 
(Ge —- -—- — > HF ~— ~ —S — 1TH ~~ ~ ~ ~ ~ — 1 
| O4 ” = eT = a 
Totals BD 98 Il 28 46 77 56 GI 45° 365 T1715) 10" 825) lt 
TABLE 10. Glasgow. Height and Weight of Boys of Group B. 
Weight of Boys of 5°5—6'5 years 
Height] Pt ot LS 
1 A A 1G, 8 1S tS So Hl) tS od a) as ome 
298 1 = = 1 
30 ,, — —_— - — — 
31 ” =, = Tay ae, | aaa aa sats 
32 ,, | — —- -—- -— ~~ — — 3 
33 4, 1 — 1 2 — —- — — 4 
Shs, eS eageiyen p> sO ee = 5 
39 4, — 8 4 4—- ~ ~ ~ ~ ~ ~ ~— ~ ~ ~ ~ — 11 
36 ,, — — 6 14 3 1 — 1 25 
OT ss — 1 3 1441 6 1 3 — — 43 
38 ,, = 2D ISS OR ol 2 —_- —- — 67 
39 1 — 1 8 29 31 23 11 4 2 — ~ ~ ~—~ —- ~ — 110 
40 ,, — — 1 4 23 39 61 49 9 4 1 — 191 
AL; — — — 6 9 2 46 59 28 14 2 2 — —~ — — — 192 
2 — — — — 2 14 81 71 44 22 22 206 
1 ae 6 


1 


69 97 138 188 239 140 97 88 


TABLE 


thik 


E. M. ELpErRtTon 309 


Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 6°5—7‘5 years 


Height Totals 
298 ae 
30 1 
31 ” 
32 ” 
33, 5 
ey 2 
35 Sal 6 
36, 3% 13 
Gress bang 23 
38, 6 8 38 | 
39». aay) 47 | 
HO Bone 2: 103 
Gis Sy ily 44 96 16 6 — 1 1 146 | 
42 ,, i Tl Hes 59, Ge CO. Me a Stes, aah 8S 
43 ,, a 2 47 63 59 36 16 3) 1 1—- —- — — 246 
AL ,, 1 Oy, AAD 00 27 5 5 2 1 — — — PANG 
ie Demon eocee ssa 7 Gee = TAO 
UO - a Y; lO eZ AO OD: ele 12 —-—- — 114 
ie - 1 4 by AIA iN) 1B} (3) 4 — = 60 
1S... 2 : ee ede Sree Ok pear, ee = 27 
oo i ER a a ee 9 
50 ,, Lf = 
Totals 44 93 115 189 218 215 174 119 
TABLE 12. Glasgow. Height and Weight of Boys of Group B. 
Weight of Boys of 7°5—8°5 years 
Height 


a) 


ad 
> 


2 3 
5 5 l 
2 3 ire 
41, J~——-—-—— Av “ey. a ak 
Vp ee 897 18) 2b) 68: (3- —= =—= == — e 
ee = DS 972 198 43° 96 9 98 1 5 
444. | —— — — — DW NG Bt cl (les Bs) ily ey 
45 ,, a fy iN) Bi tsfsh hy? YO) ills} © 9) = | 
46 ,, - 1 De loue4e3Si bse 268 2909) oe | 
47 ,, i 4 &A 18° 22 Bh BY Gb Tie & aS = 
48 ,, i) SBR IO) 1K ay ORY ye) 
49 = 5 oy 6G Lt, GBS en | 
50 ,, 1 1 1 eA gO a) eo | 
Bil |p = _ 2 2 iy sy 
~ = = 1 


Totals} 1 1 4 3 10 


30 51 118 126 185 222 


174 158 107 108 76 


~ 


26 21 14 4 


Biometrika x 


310 Height and Weight of School Children in Glasgow 


TABLE 138. Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 85—9°5 years 


ee 
B4e 
35 ,, 
36 ,, 
87, 2 
38 
39 ,, 1 
40 ” 4 
ies 5 
4», 5 
3 ” 4 
ee 6 
Ds, 3 
Oz . 
5 


Ss 
Se) 


> TIA Co HW @ 


=~ 


MAMA AAaawewHePS> 
2 N CoN 


lee 


Totals] 1 3 3 11 32 85 100 156 160 162 161 198 124 89 73 45 17 14 7 4 2-1 1 | 1449 


TABLE 14 Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 9°5—10°5 years 


~ 


L 


| | Pe 
wD I ID IQ 19 
4M HR OH 
© 6 SG DO GR 


37 1 

39 1 

HO |e le es = ——- 
Tie ee | ere a eS = 
Wor ss ——-— 2 22 3 511 1 1 -—— 

48, [~———-——-—— by Ge 8} Gy = 

|| = SSO EB Bl — — 
45 ., ——=— — — 1] 4171213 10 9 2 2 1 — - - 

40.5, ———=— 1] 2 2 6129°94 19 26> 21 8 fF 1 1 -—- — _ — 

Li ty 2 91620 34 39 20 13 1 2 Q — - -- — 
48 ,, = 1 31118 381 44 48 95 28 4 2 —- 
49 ,, - 3 1 5 16 48 45 40 3414 6 3 1 - 

BI) - 2 UW PA ily PE apy sy CHL ey alah 7 —_ = 
ihe 1 a 4 49 8) 1S 24 26 enero mele 1 

52). = 1 Le a 4 2 ISOS ae Om Cee Ome. 4 

ie == 2. 2 — 8 2 49°97 58 Yoo 4 OF 1) =a 
54 5, 1 1 1 1T— 22 8 2 J — — — — — — — — 
55.) _ 2 1 2 1 —- 
ob = = 3— 1 -- 

Dil sy - - = - 1 —_ 

8 ,, = 1 = 1 = 


Totals] 2 — 1- 6 13 19 26 63°80 94 133 188.159 133 132°82°50 51 S18 124 5 2) le 


EK. M. Eberron 311 


TABLE 15. Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 10°5—11°5 years 


| 
Blea ce. 
oO SP) 
»~D q Ne) 


= | 


e 


Bre KH NODUD We | 


| Ww bo bo 
Or OU SUNT CO 


vo bo | Ww bo 


1 6 8 24 31 55 71 124 134 128 171 128 105 88 88 57 31 32 10 % 


Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 11°6—-12°5 years 
Sere ete ile eel) je | petals 
Lg te Fy: On Or tS iW oD OH WwW KD ike they, Site, Ks 

1 

1 — = 3 
il 1 - = 3 
1 1 = = 3 
8 1 1 1 13 
7 3 2 2 20 
So" 9 14 6 2 1 1 - 43 
OM 4a 5) S08> alli 4 — — - — 74 
tp) lS) Ghoys YAS Nils 10), 2) 128 
2 13 25.35 29 17 #9 4 1 1 137 
2 4 25 36 438 34 24 0 2 1 1 1 180 
— 4 § 20 41 55 36 12 12 2 1 188 | 
1 2 5 19 34 61 37 19 5 1 184 | 
— —: il 2 4 16 22 30 17 7 38 2 — — Ll — 107 | 
1 5 38 9 12 26 138 9 2 80 | 

4 (6 lb B36 6. 18) 8 56 
i 28) of 360 40 OB 35 | 

= i—a—s 4A 4 = 14 

—- —- — — — — 1— 1 —~ — 38 1 i, al 1 1 10 
pa a ae 1 1 
— = 1 1 

— 1 1 
eS | ES 
13 48 70 1380 155 172 175 169 118 102 49 26 22 138 18 2 4 1 1282 


312 Height and Weight of School Children in Glasgow 


TABLE 17. Glasgow. Height and Weight of Boys of Group B. 
Weight of Boys of 12°5—135 years 


Height 

5 ih il 
iS ta 
love) S 


wn 


tnt Os 


Ss 
WAND OW WOND ABS Ss WHO O 


bo 


a 


DOS SRE EY 
— 


c 
S 


De HR ow 


bo 


1 
1 
3 
8 
of 
9 
2 
5 
3 
3 
1 


feet 


a 
mw oognok dr 


= 


Totals —— 1 6 17 65 113 150 219 223 171 114 78 4613 8 9 3 2 1 #1 


TABLE 18. Glasgow. Height and Weight of Boys of Group B. 


Weight of Boys of 13°5—14°5 years 


Height Totals 


u 
> 
= 


428 2 
43 5, ] 
44 ” = 
45 ” az ae: ae ia 
46 ” a 1 = a = 1 
47 ,, : 2 I 3 
ie. a = 5 
i ae = ee See 10 
50 ,, Sa ee Sa, SP Brae? 18 
Due, SS Se al 410 7 5 2— ~ ~ ~ ~ ~ ~ ~ ~— — 29 
52 See ee a 28 
53 = 61s <4 Ons 1 — 35 
54, fo= = SS 4, eS 8) To ye ee een 
56, Pi Se 6 AS 6 a 
56 ,, 3°23 SG Sede = AT 
57 ,, lo 6 ©9212 26) ae ee) 
58 ,, ~ 4 2 4290550 2) -— — 27 
59 ,, = ae 9 3 2 “1 3.93) -— 1. See 
60 = 1 l= = 2 
Gil = 1—- 2 — — 2 — — 5 
PB d |eee eo = 1 1 ae 
68 ,, ee er a 1 
O4 ” : = ro 1 1 
Gos Me 1 1 
SSS SSS SSS 


Totals 1 — 1 8 23 40 54 54 67 44 39 30 155 7 9 1 383 — 1 397 


eno At RAS 


EK. M. Euprrton 313 


TABLE 19. Glasgow. Height and Weight of Boys of Group C. 


Weight of Boys of 5°5—6:5 years 


Eewamieieeriiert ae let f= ee ay fy | | | Totals 
GMa oe PG BIG a se ye FAG Ay | uD 
Peal Ay NS SSE ery ae Sel esis fey ss QV S) 
8) ial Gabo Gayo Se eas sre ub i ears) ZWD 


TABLE 20. Glasgow. Height and Weight of Boys 


Weight of Boys of 6°5—7‘5 years 


| 
bo 
He 
— 
or 
ee 
“IO bo 
We 
| 
| 
| 


Ww 
i 
w 
Or d> CO GC or 
lad 


| | | | pow mi ator 


Totals 


314 Height and Weight of School Children in Glasgow 
TABLE 21. Glasgow. Height and Weight of Boys of Group C. 
Weight of Boys of 7°5—8°5 years 
}Height ty fo St eh ail, 7h el eels gat ae ee dh | jTotals 
I HH DH HH 19 1H 1H 16 H HH 1H WH 1 19 Ib | 1S 
Ts S9) AO) eS So Spiel imiteemena si a) sy tos Ne GSP Sa) ey GSS Sa) 
9 9 H HH © SS YS SH S59 ©) DH ) H) © OS © = 
308 1 
ohh Ss 
36 ,, 1 
37, 1 1 
CT an Pee ah ke CMO did ihe oi) 6 
29 Le To A ee 11 
Gor A ao oe 10 
pt eer Ge owe 32 
42°, Se Sa 43 
5S a We =, Soe peed ally 66 
eg (mes sear 5 TT 94 
| 455, Pe 1 010. 22) 7a 14) 3. an 104 
6 Se ES ape PRS 90 
47 ” Tl ae Sat Ge Se PSS SC ee ee eee: a ot eee ag ena 2 
4S ” ae q 
49 ” 
| AO op 
| ol ss 
| 52 ,, 
NODE Gs 
54 ” 
95.5; 
56 ,, 
57 4, 
Ooi, 
Totals} 1 1 5 8 23 37 66 75 94 88 61 72 52 2617 7 2 4 1 1 — 1 +4 642 
pei el [see = = = = = neon 
TABLE 22. Glasgow. Height and Weight of Boys of Group C. 
Weight of Boys of 8°5—9°5 years 
Height Totals 


an 


wer 


RHE HEH Oo Ss % OS SS 
ZA NMW Ss WHS OAWAND 


— 


47 


| 


PTTL LET] al ee] erro more! =! | 


wel | 


i 
WD re Ob 


1 
1 


5 
4 
3 
6 
5 
2 
1 
1 
1 


| Se 
NORE wWoOORR 


Bere 


E. M. ELpErRtTon 315 


TABLE 23. Glasgow. Height and Weight of Boys of Group C. 


Weight of Boys of 9°5—10°5 years 
Height | | | | | | | | | | | Totals 

enh ty ess eye athe key Ney Uy Ne ea io) 

m 8 C f= SO ) ~ 

Oo © Oo © 2 ~ 
392 -— 1 
AD op || Le re are 2 
Aly J — — — ee er ee = 
Oe. : 33 
43 | | Sapa aaa, 1 1 4 
4A 4, 5 3 3 17 
4S OS = ae eae 1 5 4 6 4 4 2 ——s —— comes —) 26 
cme el Oo. 3 9) OMA 7" 7. 5 = = SSeheil 158 
in ee ee 8 7 ay 10 4 Tod == 1 a 65 
48, | — — — — —- — — 665) 2011926 18e1t 2° —= 1 2 = = 110 
49 ,, — LB Gu Bis 1 ee wl wk By | Se Se 
50, f— — — — — — — — — 5 Gi lOnls,o3 10. 8-6) 3 ==] 88 
aL 4, = i i BO) By 13} MO) WO) ak 62 
b2 ,, — 1 @ © yf BR BP ak a ee et 
53 ,, ———— Oe Oe eStats 20) 
54 J —— — — — — — — = eee aaa Be ss 11 
55 ” nay) = as i rl = SS SS 1 1 — « —— 1 = 3 
} == i l 2 
2 i a ee i 
6 19 32 33 66 69 91 68 81 37 616 


TABLE 24. Glasgow. Height and Weight of Boys of Group C. 


Weight of Boys of 10°5—11°5 years 


Height Totals 
Ss al | it ! | | I i u i d Ney Al id a wD il ry if it rl il L | J a 

3 SHA SHS DR AHH HK DAH 4 S 

=> S OB, i 68S 6 6 MM BR RM NR Go oo H is) 


252 1 
oe) oe ik 
40 4, —_ 
41, J—— 1 —~ — — — — — — 1 
i 2 
we" 1) 

4h ,, 2 4 
45 ,, a) 9 
46 ,, Oy BY) 9 
AT ,, Be 9) 4.2 Selo Al = = 36 
48 ,, 6G 16slhel2 == 8) 2 == 2) = = 65 
49 ,, J — — — — — AWS AWeO iS eal 3a ae 61 
50 ,, 55 LO O9al'7 to 1k Bad) = = 96 
51 ,, LG) aly WO GT 2 80 
52 ,, = TL AIL Wee GY IS ee) = 75 
53 ,, — lS Le a OURAN Meron sees We 3 = S59 
54 4, | — = 1 Oy ese 8) Gel al BP Oe a a a | BG) 
55 ,, oe ul eo sot b) 08) Orr Fe 

56 ,, a et eee 0 yi Se 

Se SS SS 2 [=| 


39 44 25 18 


316 Height and Weight of School Children in Glasgow 


TABLE 25. Glasgow. Height and Weight of Boys of Group C. 


Weight of Boys of 11:5—12°5 years 
certs ee ie ee EN ek of" Howls 
Pe i eta Sie Dae Sagal Ue eto) bal ak Us) sy tS) | > pe SS 
Le OQ SH SSeS Ss 897 8S 1G) RE RD CO SS Oe SO Sa) | an 
Shr Ser Ui AOI Gi 6 8) SO! KO) UBS) BSE Ee Co Co ROOM EC) EOD EOS = oF 
35% —— ee Dee 1 
4l, Jl —— Se 1 
oes ] = == Sa OS OS SS SS i 
Jhy J- 1 — —-———— = 1 
ee ee ee ee 5 
46, |— 1 2 — = —_—— -— 3 
Vy derma Va ee a sy ye lee) lg Se — —] 13 
RS EE a SS a ee ee 26 
10) NS a onlay — —{ 41 
Oy —— 17 91812 611 2— 23—— —— 1] — — — — — 69 
ol ., —— 2 2 9 14 2823 7 7— 1—— —— — — — — — — 93 
52. ———— 315191819 461 1— = 86 
co 6 715 18.17 96 2) 2 — 72 
i Te aye ePBy als, (Ge Oho Ts ee 81 
DD, 1445 8 6 4 32 ———— —— — 37 
D0 ne —- 1— 4 3 56 810 3— 2 — | —— — — 37 
Sr APO Se eS. tO meal = 17 
AS = — 39 WS 7 ee 7 
59, — -— l — lili1ii1i— 1 1 8 
60 — ee oh eee zee 
] 
1 
Totals] 5 4 13 29 36 74 86 83 88 54 42 30 3011 7 3 1 3 — 1 — 1 [601 


TABLE 26. Glasgow. Height and Weight of Boys of Group C. 


Weight of Boys of 12°5—13'5 years 


66 
| 67 


ea a en ey EN é 
cee ee 80 
4 4—- — — 1] — — — = —-} 62 
15s¢ [2 oe ee 
16 9 9 = SSS SS eae 
6 199 493 Pie 
913 6), 222) 
By Sou ieeercbe le Sle) ae 
a 2 vO aS ois 62) al 
— — 1°41 2) 2 = = = 
— — — 1_m— 2@ ii — 
— —- — — —~ ~ — 1 2 ~~ —°1 
— — 1 
—— 1 
i 


| Totals 9 5: 90 32° 77 94 “881 80 154. 58 95 19978) 10) Gun ae en 


E. M. ELpERTON 517 
TABLE 27. Glasgow. Height and Weight of Boys of Group C: 
Weight of Boys of 13°5—14°5 years 
Height| | i ee aS ee eee ae | | | Totals 
EME rea et tay 8 AS i AS aR AG | 
9 DS 89 > ™ itn) er) Sp) aa = » D> Sa) fag al wD = Be) | 
ite} “WD Ney NS ~ ro ie“e) ive) Dp 5 je) S S i | ™ In| 2 
SSS Slee ot | 
i) = Se 1 
49 ,, J ae Ser i 1 
50 | a pe ame 4 
ee 2 TY 8 6 
SD os | ees ea ee Ha = 17 
53, 31° CRS as ine spay eh E 26 
55. ee oe G6 ae VG aA 2 lho 
55 ee Fh 816 4 ee 35 
Pe ee eT 6 10 18 a Se 1 Bt ee 41 
Die a TU 8lUCUGCBCOCC CB TD 33 
Se 08 at eed. Bk ye 
Pe ee a Ge 15 
60 ,, eee Vere s eo Roi SN Pare Se ON ag 
61 ,, = ea a ee eee ee 4 
ee le leo. eae 6 
Oo. a Ae ee es ee 3 
64 ” aie => 1 asi tees Caw * 1 
65 ,, ae ee ee ee 3 
(50 Ne ene a 1 Ee 1 
69 ,, —_—- — 1 lL | 
Totals Ae Open led) 49) S00 29 9? 1b. 8. 8, GB BD, 259 | 
TABLE 28. Glasgow. Height and Weight of Boys of Group D. 
Weight of Boys of 55—65 years 
Height a ie eee lim Serre a 
She ee yeh ee ey ee Slike 2 Sages £ : : 
365— ea 2 
a7, 1 2 9 
355, 48 15 
39 6 4 17 
40 5, 6 16 i 54 
ee 2 6 8 al 
Das 1o8 if 92 
43 ,, I 8 71 
44 ,, 1 44 
45 ,, 7 25 
46 ,, 1 12 
47 ,, 4 
48 ,, 1 
Totals 417 


_-Bicmetrika x 


~Z OD Ti Us ¢ 


a) 


OH OH GUE BE REE 
S 


We DWN OD 


318 


Height and Weight of Schooi Children in Glasgow 


TABLE 29. Glasgow. Height and Weight of Boys of Group D. 


| Weight of Boys of 65—7‘5 years 

ae mens Oe eee oe le ee eee Seo 

Yee et Moe rye Sie Oey Ye Mah 9s Mays ky 

Lee StSS Ne ope Gey Tey) Tee Fey Sy 9G sy) OS 

sh} 6 © He SBS SE SF SF YF HO 6 G 

1—-— 2 ~~ —~ ~—~ ~—~ Tre ~—- ~~ ~—- — 

— 1 = ——— oe 

— — 2 2 2 1 1 = 

— 2 OD) 2 eee ee 1—- —- — — — 
te: A) =e lpn aS aoa eee Maem ages 
om | ep 2 Sao al re el Oe) 1 = 
ey ee ee a MA wey OIG oh Si : 
44 4, SS 1D ly BR OB se 
Feats (emcee nate ay) ear eH OP MEL ACY §. Bh) Ge 
46 , 3) 8 Sallis Osean Oom a8) ae Tee 
awe 1 ieee Gy ise eee eee 
WRX = ee ees 
ve a ot ee ey eee Se 
505, — 1 — 
Totals 1 4 138 14 32 65 90 84 80 44 29 20 19 6 

TABLE 30. Glasgow. Height and Weight of Boys of Group D. 


Weight of Boys of 7°5—85 years 


| J Is J i al ’ | \ | i Y | | il ; l J 0 | u | I! f 
ey <3 5S U RS Fey iS > > S : Q 
x 9 §9 WD ~ 


~~ 


Ss 


a 1 a= fae 1 
1 = 2 _ 3 
ae gh a = = 1 

Se ee OS ae 3 
ee eet N Fil ai ee) al é eee 8 
ss 2s 159 ROM ON hae — SS | 58 
Occ 16G ble £6) ae opeel eG 

Seg eRe) icteats oor) OYh RB. Be = 71 
Hh) tee = Lae 8) Soe eee ed == || 8 
= 1 1 98) 20729 28\rs oa Se aoe 
: 9; 0 0F 16-220mo0 100 

1 2 tA er Ommnly 64 

= Sos see eee Oe a 38 
16 

4 

1 

2 

—.2 h&. 4 83. 42. 55. 94°72 80 69 68 26 14 9 5° ‘Ge ==ameieminnen 


EK. M. ELpErRtToNn 319 


TABLE 31. Glasgow. Height and Weight of Boys of Group D. 


Weight of Boys of 85—9°5 years 
ig Totals 
SEEN ak sta el ee eee | f 
Gy ae Sa) ey SS a els Wea ee bent Sy res tee he Spy ey te 39 
3 > S19 5 Bo SCS ES $ 
373 1 
38 ,, os 
39 ,, iia 
40 ” a 
4l ” 1 2 
42 , 1 2 1 6 
We & 4 a Ze 
Dh « —- I 235 4 2—- —_—— 
fo Po 2 WO 1Oe Ge 1 Ba04 | 
46, J——— 7131310 7.5 3 1 — | 
ieee = 15 24 17 1b 8 8 2 
eee = 421615 01-19 1619 7 1 
49, J—— — — — i T2210 12 th 8 3 I 
50 ,, | — — — — — Bees 2) el 20-0) 19) 13 — — 
IP 2) 2) 2 ADT 33. 5 3 
14 6 4 2 1 
12121 1 2—-—1—— 


8 16 36 54 614 61 71 5! 9 


52 41 25 15 


TABLE 32. Glasgow. Height and Weight of Boys of Group D. 


Weight of Boys of 9°5—10°5 years 


Height [inca ah fea eee ease eae | Totals 
: - ey ey > 9  \ yD 
3 2 

36 3 1 
oe 

3 ” 1 
43 5, 2 
44 55 8 
45 ” 2 8 
46 ,, 6 5 1 23 
Ty 8 8 Ale 3 48 
48 ,, 4 12 Bia eal 5D 
49 ,, 5 6% 20 15 94 
50 ,, 2 22 26 115 
61, | —— — — — 1 917 84 
52 ” AO OMA SeG 453) — 60 
Dy 3 40 
54 ” 1 12 
55 ,, 2 
56 ” 3 
Siew. 1 
58 ,, 

59 ” 

BO 

Cn. 1 


Totals} 4 8 10 24 27 34 68 50 6: 52 45 25 ¢ 558 


320 Height and Weight of School Children in Glasgow 


TABLE 33. Glasgow. Height and Weight of Boys of Group D. 


Weight of Boys of 10°5—11°5 years 


Te 
DQ 9 


J! 
iG 
i 1 
io 13 
49 ,, ‘ ray mies 
Ol, Bediky py aye ty ey YE ak bal 
Sil — /610920 14516) 9! 6553 
52, J—-——- - 1 3 41014 380 17 1211) 4 7 1 2 — 
53, | ———-—- — Ne es) oh kay ye se Gy OS} Te TI 
Bee — 5.5 6 8 410°563 253 =] 
55. 9 4°36 4 4 3 900 2 
He) <p ] 1122 3 1—— 1} —— ~— — 
57, 3 ah 
1 


bo 
| 
e 


1 1 11 9 13 26 49 53 64 65 75 49 48 31 3118121210 7 2 3 1 


TABLE 34. Glasgow. Height and Weight of Boys of Group D. 


Weight of Boys of 11°5—12°5 years 


Serine tal ye 
SESTESSS SRCRKRSITESRSESS F 
l =—— = 
= ] ss _ 
1 
1 2 —_— 
— 8 Se 
Ue Bie le Bl — 
aa 2) 2 Oe Cale eee —-—— = 
= e215 e ON Ge 7 a 
—— 3791212 6 6 38,1 = — — 
= sl 1 39 8 18020 ot 6. soe 
—-— 12 7 527 3714 8 2 1 — 
eS) Sle PEyP Py ail ce ET 
1 Let 3) 15) 1 s10%) S; aeela sie 
-- - 1 5 99 71051023 I 
erie. Se 2 
1 22 aoe ee 
be Bo 2 l 
2 l i === 
: : ae 
1 — — = 
Totals] 1 — 1 10 13 39 44 54 83 112 77 50 3292113 6 7 — 3 2 2— 1 


es 


K. M. 


ELDERTON 


TABLE 35. Glasgow. Height and Weight of Boys of Group D. 
Weight of Boys of 12°5—13°5 years 
Height Totals 
443— 1 
45 ” 
40 ” a 
4i ” 2 
48 ,, 3 
49 5, a 7 
50 ,, 6 16 
51 ,, 6 37 
52 7 48 
53 ,, 6 1 72 
Cah 1 = 84 
55 4, 1 4) 1 101 
56, nie 2 =e 80 
On ss 7 ae 68 
58 ,, 4 See 42 
59 ,, 4 3s 21 
60 ,, 1 Tesi 1 14 
Bi. 1 De ee 9 
Go. — elt, i 
cee ieee age 4 
O4 5, 1 
Totals 32) 31 59" 779 TO 119 65 62 22) 26 611 
TABLE 36. Glasgow. Height and Weight of Boys of Group D. 
Weight of Boys of 13°5—14°5 years 
Ese] ol Ce a a ee 
y VE) UD 2D 1 WD voy ey ite) Lays los oy 
= a m 2 i 
5 ee Ol 
51 | os ara ee a 
ae 5b 6 = — 
eee — 1-6 4-10 2 -3 
54 5, —— 1 45 815 4— — —- 
DS io ——— 2 3 41310 4 2 1 
56s || i ee toe) ee ee a pe 
Di § ———— 1412 97TFT— 2— Q 
XS || —— 2 16 OMI SL2s 245 Go we al 1 
59. | — — — — — — Op By= 8). 1K0) 2 aie 
60 ,, Teja oeeo eh Geet Tha . 
ot 24a 1 SS 3 ee) 
62 ,, peers Oh sult oy ae ee eT 
63, le Oe re as 
O4 ” Win’ = iat Sane eg ae eT Oa ee eae ee ee 
65 ,, 1 lo il Se Se 
66 ,, — — = ——— = = = | 
10 5 | — — — - —— — ——— 1 
a 
Morals i we 1299-95 47 6415940 41 1396 14 7 6 5 2 1 2— 2 — 1 


322 fleight and Weight of School Children in Glasgow 


TABLE 37. Glasgow. Height and Weight of Girls of Group A. 


Weight of Girls of 5°5—6°5 years 

hel 

a ss 
29 4, =a) = ae 
30 ,, ee ee = = 
Cig. = — — 3 
oe — 2 eT 1 6 
he — — — — 38 1 i— il ili — = = 8 
Bh yy —-—— 2 8-4 5 1—-— 1— —~ — — — — — Ff ,16 
85... i ee 88 DOR aie Le aera 1 — 44 
96 — — — — 8 122 122 1 5 1 — — — 49 
ee Le 1 = 38> 9530 sd BOD ole 1 82 
38 ,, —— OTA eke) a Sib a8 8} — 124 
oo 4-13 43 59 23 19 6-=— (9° = j= 170 
40 ,, 1 7 24 42 55 46 14 6 2 — — — — 197 | 
Via hae 3 8 6-20) 45> 757.) 19) bn Sa oe 165 
Lge 1 1) 49902351) 929) S17, Ik oe eee 146 
4B ,, ieee gi Ahkeiloe uch Yi Oc — ie 
As, Px SB) Gp 07> SO pee = 38 
45 ,, = | 1 1 1a oe il 
Vow - 1 -= “2552 6 
hae = I Ae 3 


Totals 1 1 4 8 25 50 110 146 183 186 205 100 47 41 24 8 2-2 1143 | 


TABLE 38. Glasgow. Height and Weight of Girls of Group A. 


ae 


Weight of Girls of 65—7°5 years | 


Height ik ih al i ‘I it 1 | 1 | L | L | al a Hl al t il | al ab Boels 
yk aie nest Daas OE ; oe 5 ey 
t aS > > Ss wb WwW Ns) 

30 &— 1 
COL” ex. 1 
82... 4 
Some 3 4 
o4 ” 2 er 6 
85 ,, Do 18 
BO, abe ali 23 
BT. 5 ee KG) = 38 
BET 1153 1 69 
39 ,, 4 16 1 119 
40 25 3 = 164 
tee = 8 4 a 218 
HD — 2 2 2 1 — 215 
43 ,, l Oe ay = Il == 211 
44 ,, A kB 166 
wa) Ley sy" 7 91 
4G ,, SOs oo 34 
LP ee ya 8 ll 17 

1 iL & 6 


= 
1? 2) 
| 


Totals 3 BY tb 95 153 189 265 191 1438 143 : ‘ i g 1405 


E. M. ELpErton 323 


TABLE 39. Glasgow. Height and Weight of Girls of Group «A. 


Weight of Girls of 7°5—8°5 years 


j la al : 7 
ae ee ee eo eh ek te ae 
SY DB ms DS © & > ™ SA) 9D co SH Ba ay Rey Ge Sen tsa ey aoe UD 
RNR VR RN 5 46H & Say = > = = os got a Nene es hn Feng ken Se Ss eee eee co 


1 
1 
2 
4 
1 
A 
5 
10 
3 
6 
1 


AaNOoORe wy 


Totals] 2 — 1 7 25 39 71 108 183 199 189 197 182 80 71 46 29 5 5 1131—41 13938 


TABLE 40. Glasgow. Height and Weight of Girls of Group A. 


Weight of Girls of 85—9°5 years 
Peon st | | | td | lard eet | [Totals 
Eee oe eNO 2S Rotor IG) US is. Was MS SS | 
yar es ID sy BA mw ID a~ ™ 89 cys 2S) 9 9 ~ 
Q wm s 2 8 3 : 
328 1 
3S 99 nn 0 nie! apenas ce ee ON a GR ae a ee ea See il | 
OW yy il) 
a5 3i| 
36 ” 4 
Q = ] 5 
6 3 14 
5 1 14 
Lessa lees 2 — — — = 26 | 
36.112 4-56. — 46 | 
SoM PS eS: oy Gas” I - 72 
A tae) 07 DOR 20m Gre 3 121 
— — — — — AW 22e 73840 Aer ee D8 1 160 
== — — 2 712 33 51 46 33 10 10 3— 2 — — —— — —J] 209 
45 15, 42> 40°30 32.98.10 2 3 — — — — — 1 |, 219 | 
— 12 7 li 26 32 23 3012 6 6 1————— 163 | 
2. 2) VO Ge fo 2438.15 12 4° 3 1 —.—— — — || 133), 
— 7 3811 91697 2 1———— 65 
—— — Ih Sol Miler =—elleSslO S.Quil 2° Bo 4. 1) a a 37 
— — eS pS ee ee ee Ss 17 
— 141— 7 
—_—— — = SS ee ee 2 
a Mi ene 1 | 
90 135 214 170 129 112 122 68 40 28 16 10 1314 | 


TABLE 41. Glasgow. 


Height and Weight of School Children in Glasgow 


Height and Weight of Girls of Group A. 


Weight of Girls of 9°5—10°5 years 


n) 
8 
SS 


Diy 


Ce ¢ 
a) 


ec 
Ys 


RR & aw 


VS OS 


es) 


SS 


nS 
_ 


Nt~ 9 


a 


1 
4 
0 
6 
3 
6 
3 
1 
IL 


3 
womornw rs 


it~ Ce 0 


FNL OR KWH 
| me Ore Phe 


| | 


epity 


ax 


AAAAA AGH 


Cs 
eo) 


22 36 45 74 131 107 147 135 153 112 100 


74 48 30 2! 


TABLE 42. Glusgow. 


Height und Weight of Girls 


Weight of Girls of 10°5—11°5 years 


ae ee | 
Mote viet oe Woreierilap ape at Wel iene 1 ike) 
STEAD aah SGN OS) Galo Nee RS GS 
3 St SS + J Ye 3 


U 


eee 


ns 


x 
OS 


on 
bo 


Gg Cs Ce oe oe 


& s 
SS 
exes 


Wi Le CH 


DE Db orp w 


wo BATON TR ee 


CON 
roe | eo OUR 


= 
a) 


HN eS HSH 


S 


Mts Og ]WWN 


an 


e 
w 


et 


1 
8 
6 
18 
16 
9 
5 


PpoaT@o 
BSH CODDNWe 


Oo ND 


AA AAN 


S 
Xe) 


| Totals 


5134) 18022) 55) 3) 272 ol 


EK. M. EvprErton 


325 


TABLE 43. Glasgow. Height and Weight of Girls of Group A. 
Weight of Girls of 11°5—12°5 years 
Height Totals 
TL 1S Se GS tana ran es 
lim twos OS ace l 
35 ,, il 
38 ,, ] 
39 ,, =e 
40 ,, 2 
41 ” il 
42 5, 2 
43 ” a 
44 yy 13 
45 ,, 20 
4O 4, 32 
47 5, 65 
48 5, 83 
49 ,, ) 139 
50 ,, 4 162 
267-1 A l 161 
20 ieee ol 153 
— — — — — 112 7-1 1 — — 133 
3a Orb — I le 92 
4b Oh by cay kh 93) = eS SS 65 
All) 83, PE ie 1) DE Oh. ee 41 
TRA a a) see i eee ea ee 27 
Sot elle lle ———t le a 1 9 
as ES — i 
| 2 ee 1 | 
3 
a 1 
20 41 70 OD De Otsolelo elo Ome seem 63) doe 1s 7 1214 


129 159 


of Girls 


of Group A. 


TABLE 44 Glasgow. Height and Weight 
Weight of Girls of 12°5—13°5 years 
Height} | | | eae | | leeelee lie le al : Totals 
Se] WW WH H WW 6 BH WH lH H H we ww IO WD | 
Reese es DS 1 SS RS SS 
Se See Se ay Ss) SOP OP at ROMO” Os BCI ho a 


139 120 


= It] 

4 | _ — 

2 3 — 

8 4 1 — — 

Ob 43 2, = 
13.1310 1 1 = 
712 4 12 — = 

3) 6B) 1 il eS 
12 6—11—2 ——— — 
Ae Deedee Dt 2, 1 — 
es 
een ES EG Ye es ee ae ee 


Biometrika x 


326 Height and Weight of School Children in Glasgow 


TABLE 45. Glasgow. Height and Weight of Girls of Group A. 


Weight of Girls of 13°5—14°5 years 


a 3 
> ~ 
Ks) io) 


464 — — 2 —-—--—- - -— 2 
47 ” 1 a ah SSS 3 
48 ,, Boe ee 2 
vom GBM Se Sip ROMREee gs =o tamer 10 
50, ==, A OC Ogee Se pasty vel at = = 13 
bl | eee 8 68 ea be 2 es 
Vee) Weer oe i sO) Gy A _ = 33 
58, | — — — 2 2 3 2 6 4 4 3— 1~—~ —~ —~ — — — | 27 
hy | — — — — 1 5 1 10 8 56 5 1 ~ 1 ~ ~ ~ — — Ff 37 
55 ,, = 1-9 10. -% 80 8 ee en 
56 ,, — 2-167 8.8 4 6 6-—9 — — Wee 
Cygne (eee ee ee Po Se a eT OS 
a (i ee ee ee Peay oy ae OO |! Be 
BYP) 3 —- — — — — 1-— — — 1 4 3 «4 a — o al 2 22 
Ce | = ee eee ee SS 3 
61 ,, = = — Ss 1 Flt Sse a] = 5 
2 ” Fay = ch Ges Osa eee 2—- —- —- —- — — — 2 
| 20 30 22 28 11 291 
TABLE 46. Glasgow. Height and Weight of Girls of Group B. 
iF a | 
Weight of Girls of 5°5—6°5 years 
Height rama lave lem Totals 
: eo) Re RD 
isp) 9 2 jer) 
39 > => 
Diet em gal ae ah 2 ee 1 
30 ” 1 4 ae .- Lo ae 1 
Bie ——— a= ae = = = SSS 
32, 2 = 2 
By 2) 4 
Bh ” os 2 1 =a = a d 
Soe Ne oe ade Ome eee a 14 
86. 1 oO --6 10-9 = SS ee 28 
B75 Se BOB 6) I ee ee 49 
38, = OY 29. 86 24.10. 5 8 ee eee 78 
ore 1’ 93s 18°238" 888033). 14 924) ee rn 152 
40°, [oS TD 84 P81 536 80) 13) 29 se en eT 
le. — — — 6 18 48 89 45 22 6 3 1— — — — — 188 
42 ,, —_- — — 1 4 9 31 53 34 14 5 — — — —~— — — 151 
43 ,, — — 1-— 2 6 12 37 36 22 21 4— 1 ~—~ ~— — 142 
44 5, — — — — 1— 4 14 17 18 12 8 — — 1 — — 70 
45 5, — — = — 1 1, 4 8) 5 8 7 Se 24 
Wi eee Oe ees Te 8 
Vat pee = SSS Se Sat 3 
54 5, =a mM ee eS Oe SS SS SS 1 
55 ” Peat, Se ee eb See a gee a ea = 1 — 1 
Totals{ 4 17 37 93 148 171 198 201 133 61 55 18 3 2 8. 1 Teme 


KE. M. ELprERtToNn 327 


TABLE 47. Glasgow. Height und Weight of Girls of Group B. 


Weight of Girls of 6°5—7°5 years 
Height 
327— Seeger 
$3 ,, Dive VS 
34 5, a 4 2 1 
35 ., —= Sey 
BS os ee = 5 = 
ny ae as a= $2 2%5 A SF ah ee ae = 
gee eee me 7a OPC IO) 4 Ae 93) lt Se 
89 ,, > Do Sly bay al) ye 8 ae! 
Oe i 3) 20. 36 298 280° 8 8 1. 1 1 ae aa 
41 ,, = ToS A NO). Zu 8h) Dy i OS Se 
42 ,, ee elo oh 160) 47 2 lo) 9) 9 
4a, | — — — — 1 2 2 52 57 45 23 7 3B ~— 1 —— 
ane 3G) 80) 56 465 36 20 8 38 2 ———— 
45 ,, = 1 9 14 30 31 23 15 4 @2 = = 
Cn a 8 9) 8) 18. 13) 8) 5 = 
ies — — aS > Re EG OD il 
48 ,, —— SS SS SS 1 1 — 83 2 4 
499, | — — — — —~ —~ ~ ~ ~ 2 ~ ~ — 1 1 
50, J — —- — — —- — —- —- —- —- —- — — 12~2 1 

157 224 215 54 Ql 


TABLE 48. (Glasgow. Height and Weight of Girls of Group B. 


Weight of Girls of 75—8-5 years 

Z x PRS om, i a See a os 4 | 

Height | ot aan Same a ae (aa eile | | [Totals 

Us Be a ‘ DD DM kD w% | 

Dd 89 E Sa) a o> | 

a) = ID UG ie) 

| 
3 1 

34 ,, oa 
Oy ‘ 
36 ” 1 

aT ” 8 | 

RE ss 13 | 

OY 5 34 | 

40 ,, 45 | 

4l ,, 87 | 
42, J~—— 1 2 3 622 35 30 16 9 2 J—— — — — — — — 127 
meee G88: 89. BI -36..80) 16 oh) 8 9 = 177 
44 ,, t= 208 

4d 4, fee al 229 | 
AND sy. | a ral yes 166 
47 yy J — — — — — — 10 2 4 132 
48 ,, Ona 69 
ID | | SS SSS SS Al cd 33 33 
50 Sup 16 
Bil S279) 1 4 
52, J———— — — 1 

53, | ——— —— — — Lo 

Totals} 1 1 4 15 25 51 78 155 16: 159 115 96 30 19 12 1356 


8 Height and Weight of School Children in Glasgow 


TABLE 49. Glasgow. Height and Weight of Girls of Group B. 


Weight of Girls of 8°5—95 


years 


DD 
=> 


I 
fos) 
4 
Ke} 


Coe Orb bo 


| snaozaal | 
Hanooquere 


Totals} 2 1 1 38 15 42 74 92 123 163 161 142 156 132 98 58 44 23 8 8 5 4 1 2 2 | 1360 
TABLE 50. Glasgow. Height and Weight of Girls of Group B. 
| Weight of Girls of 9°5—10°5 years 
Height) | Pd lb al a ee ee 
1 19 ID 19 AS 2D WO AD Wg AHA ADAG WAG AS ts to Oe oe Romeo mS) 
a eS depremab Say ats) =~ on) ™ SS) ID NS ory asl SSsaleeewatss] les) Gash Ga) US) oO 
HH H H SS SS Suen HIN SHKCONE cE IRS Ns) To sy AS aS RS CS 
367 ] = 1 
a7 ae 1 : = 2 
eoe. il 1 1 - - == 3 
295 el 1 1 =_ 5 
40 ,, 2 1 1 ae 4 
ee Cee ee ee alee oh = 10 
IO Neale De eae ieee _ eon 
ie ae eM ye a. od 2 eee 
i ate rem besa ealeg alls Oy. 1B) ee ih he es : = 62 
DHSS ——— 1 2 317 30 23 22 to 2 wl — —_— 117 
4O ,, ———— 2 217 22 28 22 20 13 8 4Y— 2 — 142 
Wie Soyo} eis Sh) Zbl Bey Bey = 4 Hl —- —_— 194 
ke 5 | SS SS L169 le 23830 AC Sl Ae eee ee 190 
49 ,, 1. 8 811 1b 44 41.99 76.8 6 7 eee 185 
00%, ] ay ES) lise Xo} iO) AT 0) GO ee 136 
ons ] 2 1 1 By ey Ty aly IG) Wk il By HY sD — 1 110 
52 ,, 3 2°98 9 9 9 7 8. 3— Teneo 
53. Pipe lbes) Pe aNey in lee I] Lael 24 
ae 1 1 1281 eee 8 
| 55 ,, 1 |e ee ee 2 = peal 8 
item (eas oe ae ai ee ee lo 1 = 6 
Ce aoe 9 Sy ee 4 
3 4 11 24 35 76 103 127 136 126 176 138 116 81 


K. M. ELDERTON 329 
TABLE 51. Glasgow. Height and Weight of Girls of Group B. 


Weight of Girls of 10°5—11°5 years 


[Eh PT a a ce 


Totals 


ee oy, eg on see Se ORO, WS te “tt a ie Ass G. 
Soe Oats SO LOR re Oy ry) Sd) Ce. Od 
6 % SD) RCOle eS Gece, 2S IS CO! 109) 109) 65. 09 


7 
9 

41 5— 

43-5 — 

91-5— 
3 


10 


caw | vo 
w 
aI 


— 
He bo 


| 


| ox 


a 
WOanNTRARR 
See 

— OO 


eel 
RP Ol fp bb 


Sa ew oror-1 © © bo 


| FNWWNHNYe 
| ern ores =t | [oi Hel I 


| mies arorosen | 


eae COS) Ee) 
a= or CO 


1 1 


years 


Totals 


| 
fo) 
» 


I~T He GO) COT IO 


Tee 


S 


Oro 


| 
{ 
edie 
J OV 


Se POH Oke 
= 
= 


_ 
w 
w 


bore bb b 
i 
— 
i 


WMOWDQO of 


u 
or 


ES) 
Sw 


i 
b 


| | 


| = ro | ATR NI OD bo 
| =i enon bo be 


Totals] 3 4 14 29 54 103 140 162 171 129 131 86 74 59 361231212111 3 3 2 — 1 | 1252 


OW PENH EH 


330 Height and Weight of School Children in Glasgow 


TABLE 53. Glasgow. Height and Weight of Girls of Group B. 


Weight of Girls of 12°5—13°5 years 
o 
cia (Me ee eh ob has oe ak 
Rel NSPS GSE ast WOU ONEMICRYS Tue) Moh (SN IGA tS Gal) Rey ea Ses Se) NS 
“pb ost oF 1 iD Tope resets es) usc). sy Key SN S = = Tel gtel x X 
SSS == 
Dh A iae |e eee na |e ee ee a a 
43 ” = sy a oat a eal 
ga Nias ee Dad agar ees ae =e a 
Pitino et lye ane mes Se e es — 
YGe A reseed ee -—— — 
Gp oe Se eO! GAs Eo ameD ee = — 
YSt ieee baie weed eee eas — — 
$95, = =e OD: SISA SS Ope ee 
BOs 0 Wo = 2 ALO", 1 Gomes Ge ee = — 
ble, (oe LD S00 een = — 
525, = 2D) 198 82 SP GOSme inert tee = — 
Ce | eee 2: 5, (2333931 lo o6y ole = 
54 ,, ee eI eS ale RO ayay ey I) sy zl | = 
Ds ——— 12 5 14 30 48 941610 4 1 — 1 = 
56 2 1S) 3) 10) Mibg 92407 68S Ay en ee 
Ye | ee CREO Se ie iiy (9G Seo) 
58 ,, — 3 20 10N 7 1314 on ie 
BOs Wretntos a eae —= 5 6104s 
60 ,, eet Lge Pgh Se ep 
Gi 1 ey a ye she OY ire 
62 ,, = 1 1 SS) Sea 
Gia —— 1 —— 
60 ,, 1 — 
67 ,, = zal _ = 
OS ” os 1— = 
69 ,, = = Catal apa ahh Se 
Totals] 1 4 18 34 87 144 133 180 154 125 93 63 63 231612 7 3 3 2 — 1 
TABLE 54. Glasgow. Height and Weight of Girls of Group B. 
Weight of Girls of 13°5—14°5 years 
Sa kod Wk hs 4 hobs " Sg | A 
$38 6 8s 8 SRkkSseRseSESS x 
sf—| — — — 1 — — = a 
| 46 ” 1 ae rae LS Ss as aa ee oa aN 
4? ,, = = =) Se = a 
48 ,, a i1—- — — — i — = 
oe oS ge ee 
GO. i ete nl ln ee — ees 
sf, | = '— 2-9 6 9 2)6 7 ee 
50. Ns ee 8 0 GaGa mee — — — — — 
53 ,, Se ee sy Oe) I A Oh i ae Se SS 
54 ,, SD oe LT SC oe one ele eel 
DOI. = Te A a ida: SS 
56 ,. ee ee ee cieihien) Oe eS De 
Oe, —- 1— 1 TL a Ge Ge 2 lO ae Oe 2 ee eee | — 
58 ,, 1 ae er i BY ay GT eee 
59 ae 1 Th Fe Ae lla 
| GO ,, = it oe 4 il A SI 
61 ,, = 1 peeeen MO Si RB Aye Op ee ee — 
ie 1 eo 
680 = = SS ey a ee Ss 
a LD | 
| Totals 1 1 7 11 21 25 44 59 44 45 47 31 21 21 10 5 8 — 2 


E. M. Exvprerron . 331 


TABLE 55. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 5°5—6°5 years 


Height | | | | | | J Totals 
. ae 
= <s 
i 1 
32 ” oa 
3S 4, 
Bh 5, 
§ a 
22 
43 
56 
18 19 13 3 69 
9 


PEL TT LL eno om te | 

a 

are D : 
[oe] 
Sil 
He 
| 
| 
| 
| 
| 


| Hee 100 ox | 
| ec 


TABLE 56. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 6°5—7‘5 years | 
Bega te l-t tI leery ee lade Ste) dee he he ap Botals 
Sh) Toy A Mae ipl aise Belo e alop tte 

. . . . . . . . . . ~ a . . . . | 

i 
2 
3 
7 
19 
34 

ae 50 

4l ” — 105 | 
42 4, Boao, Oil fee Ion 
ipsa Lop Shoe es 03 
4h y, 77 7 41 — — 79 
Hie (SetGu 1 232 oY 61 
ye heyy CG oee Gan) ein it 
ia Messe te or Or et 14 
38 ee ea A ee 9 
49 ” Py a a a See = 
50 ,, —- — — —~ — 1 — 1 

| 

Totals 617 | 


332 


Height and Weight of School Children in Glasgow 


TABLE 57. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 7°5—8°5 years 


’ | 


5 9 4 to) AS 


a gyi FAs =e Ney J i! | 
& LD ~N fon : ™ Say 
= Sie ast © <6 


V7 5 — 


ec Yree aes iineee Lol Se 6 2 Se ee 3 
Peer berenaree z = SSeS St 
Gia Ye Se Ee Pe a ES a 4 
|. 39 == wile ob gael eae 2 eee — 12 
$0. 9 — BR = 7 98 4 3.99 9 2 ee een 
Paces i. 1 8 eR = Bpemomme enon no 2 39 
| 42 ,, 4 AG SS AS Se ees ee SEP 66 
one — "= 1-8 8319931 1619 =3 3240 ee en 
L444, gf) ete = 19) 96? 30° 98-1 7 a) ee ee 
| Yo. Gp ee ee 2 4 1-1) 87 S18 ig) 8 a ee 
Ihe) il Sy ile AV eb 118} 5 2 2 — — = — 
lene 2 1 7858 I yoy 42 
See. = 4st 405 6 S10) 23-8 3 oe 
or 1 Li 8) 382 
50 eal ergs SO Me gl ee 
51, = 1 
Doak. — = 
Doms — — 
o4 ” are es 
Totals 65 87 129 66 
TABLE 58. Glasgow. Height and Weight of Girls of Group C. 
Weight of Girls of 85—9°5 years 
ff = 8 e : L i) 1 i d i if J J uD 1 [ al ul i J a el is) wD J J Xe) J if To t als 
32E 1 
pis = 2 
some ye 7 
THO) i Ba il 6 
Tie Sa ogee 9 
2. 2 2483 ele a l4 
sen 8 6.94 62. 2 4 22 37 
Te 4 6 916 9 56 
iar i 2 9 11 20 20 95 
pte Rei eee I “6 6 21 31 2 111 
ie Sy ei) oe Ge i eS 102 
Ban BES te nG 89 
yo 2 2 59 
| 50 ,, a 27 
51 ,, a 16 
D2. 4 
58, F—— ee er eee ee I ee ET ET 2 
i — 
De op | he es ee eS 2 
| Totals 11 23 38 46 78 97 iO) TEE a OE 639 


EK. M. ELpERTON 333 


TABLE 59. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 9°5—10°5 years 


1 


Height feel | 


1 
| | a NOUNS Eee BS). 


Ro ONGy eS bor 


Hr Tbe 


mw 


eR we We = bo 
bo 


bo 


14 21 


TABLE 60. 


Glasgow. Height and Weight of Girls 


36 45 62 64 58 70 57 60 25 18 15 6 


8 


of Group C. 


er bd Ww Ww 


me OCR SE bo 


| 
| 


= 


Il 
2 
2 
6 
7 
8 
2 


bo 


= 
WDOONWN WwW 


| =| 


Oe 


KH bw Boe 


ee 
KH POOWr OBE 


crane | ree 


— 


Ot en) 


Ros Ee or OUD 


ail era 


Hep Roe 


Totals 


3.8 


10 


83 75 54 44 


34 21 18 12 


Biometrika x 


334 Height and Weight of School Children in Glasgow 


TABLE 61. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 11°5—12°5 years 
| Totals 
| 

iy SS SS ES 1 

43 ” re a i mae ae a ae coe a ee Sa Tee 
| 44 ” are 1 1 ae i 3. oh ee 2 
[ie || pee Se ee ee Se 4 
[i Skil ee oD 1 — — — — — 7 
Pyare (eee ray 26 9a a = <=seas 
| YSin 2 == (2° 4" 8B baal Ss SS SS SS 32 
pee Pe eA TO i. Sh 7 Beg | 
VrO es jhe al Lh TNS IPS BY) ee 71 

51 sella =, S20) One dome seo wal = = 71 

62.;. (=== = “197 21 14,00 165, Bes 2 96 

5S, a = 07 15 610 10M On 2s) ie 79 

bh. Na 92 5 10,11 1696 6 48 

Do tlh ae a aoa 13 510198 6 eS 44 

56 a 2 6 664.3 1079.9 222 2 See ee 

Bi Ki eee Se ee ae 1 a3 12 (or ee ee ee eee 1 9 
[aoe 3.21 3 ————— — 9 

59 a a Se ee ee 3 

60 ,, l 

ol ,, 

G2 ” 2 
| 63 ” a OT 
64 99. Ee ens a ee ae Sage ag A ne ral Y 1 

Totals] 3 9 14 39 66 83 74 68 62 46 41 28 23 12 10 595 

TABLE 62. Glasgow. Height and Weight of Girls of Group C. 
Weight of Girls of 12°5—13°5 years 
Heo g iat | ole heh Sel gel call alan ein ae a ea (sheet eal eiberae 
2 PW WW 1 |} 8 |G WW 1 1 1H 1B 1G 1H 18" 6 1D 19 
SSSBERESSSRESSSSSESSSSS 
38 | — 
3d), 
40 ,, 
| 46 ” 
47 5 
48 5, — 
49 5, 2 2 
205; 4 =e 
AL ,, 14 1 = 
52 5, 7 1 1 
o3 ” 17 4 3 2 
Sh, 24 18 13 a 
55 4, 919 11 6h al 
56 ,, 2 5 9 4 
57 ,, 2 10 10 12 6 
58 ,, 4 7 8 
59 5, — 1 5 -1 
CL; 2 
| 62 ” 
63 ” 
| Totals 1 2 5 10 29 58 57 81 84 55 48 462515 9 7 4 5 2 1 544 


EK. M. ELpERTON 335 


TABLE 63. Glasgow. Height and Weight of Girls of Group C. 


Weight of Girls of 13°5—14°5 years 


Totals, 


| | 
LD wD 
a 35 
3 S 


| 


Th eee Sh AE yet ee Ee ee 1 
ae = Nl 
ee a nee 3 
eee ohh OeeaeeCe ee 2 
ike deo ee 2 oe 6 
5 I Ee Dt eee le eR 1a Lt 
le 934 I 1 8 
ee GO oe pao eae” ee ea | 
ee Oe eG Oe oy aS) et) og] 
— = 1 I= 6 5 4.2 —.2 2 — — — | 93) 
es ee ae a on ee ee ee 
eo oy Gl 28 ees es 8 
ee eae Ore et. De ede ee 17 
ase yet 1 | Oca pay eae gn 7. | 
me a 1 OepeeaiG dae a 5 | 
eae: prs 


TABLE 64. Glasgow Height and Weight of Girls of Group D. 


Weight of Girls of 5°5—6°5 years 

Height 

u al d a al wD il il al n ! 1 | il | il i 

ee oe eS ees 
eS ai Ey ae Reet 
35a as) ST ee ee ee ae 
36, Soe I coe Be ee a a 
37, Sole yal mee Ay eens Uh eet eee ee lee 
88.5 Dee ee ihn see Sea Se Se a ee tes 
89 4, iD ee Ae ee Big Oa aE idoh a ne Seca tas im [oe area gee Oe a 
40 ,, Se ge! oe OY he E Mase ea elig) -2 2 Geen ren ee ef Soe 
WL, - - el (ye ke XS 9 4 1 — 
Dak ee ele i OOM Ome Geclibmee Os fe oe. Me eee 
ee ae ee ea) oe Ge ee a 
4h 45 ee l Aye Ae Oh eg A LL —-— — — 
45 ,, STS eal a ae or ee rea Seo 3 4 6 3 ——= 2 — = 
46 ,, = eS ee 
Ll, es [oe MS VE 


336 


TABLE 


Glasgow. Height 


Height and Weight of School Children in Glasgow 


and Weight of Girls of Group D. 


Weight of Girls of 6°5—7°5 years 

Height | | | | | | | | | | | | [eee"| Lotals |’ 

wD Uw, UW wD iy LD LD WD LD wD id iD wD uw WD 

HR Sk SSS SESE EE EB 
37 EF 1 I — — 1 SS SS SS 3 
38 ,, 1 2 = SS = 3 
oo), —- — 2 4 2 3 2 —- ~ ~ ~—~ ~ ~—~ — — 13 
40 ,, — 4A A Gp ed — = = 16 
ilies eH 5 US er Sh 8} i 42 
vi. i Ope 4 209 20 ei a3 eas Wo 74 
Ide Vo = Se Da 298 0, So 5 ee 
44 ,, —- — — 1 She Mil ily y aifshs Igy (S) 1 Bo = = 76 
ote, — Oe San l4 Seli9) tel By A ee 66 
UB a. —- —- —- — 1 1 2 inl Oa tT Go 2B. Ae ll 1 40 
Ts = a - 1 1 A Oe Ogee Ong 2 1 26 
AS ,, -— 1 a I 6 
Os - — — SSS = 
50 ,, co a ee ee a : 


TABLE 


Glasgow. 


Height 


and Weight of Girls 


Weight of Girls 


of 7°5—8:5 years 


WwW “WD i 
9S OW 
3 9 8 


i ea 


Wy) SD ) D 
tS 39 9 ion D> 
= => = > > 


1 | ’ | al it uw? il al il | JL 
Sie AS eh ial 9 ™ 
iD KD D 9 2D © S 


1 
le Oe 4 

} E29 ee a 6 

Bet ID 2a des Cia nO, tee ge en pee ee eee 22 
ee ee ee ee ee ee eee i eee eee | 
nae ee. “OO eS eae ee te ee = = 34 
a SD 24 1G; 9) 0 5 eee 
ae ee ne ime Oey Ime Gs Oe) 1 — | 8 
ee ee Pe NN sy Oph pe go live 
ae ee SD A 75 162 1G ey ee 
= 1 =) 8 7 59°99 38 64° enn 
ey Be eS ee | 
a ee eee Oa eee Ss i 

en ee 2 

iss 2 


E. M. ELpErtToN 337 


TABLE 67. Glasgow. Height and Weight of Girls of Group D. 


Weight of Girls of 8°5—9°5 years 


Benet ed fh. all do periisetly salerale ols alae | | | | Totals 
| lp 18) 1p 1S 1S th 1s 1s 1 As tp 1B 1s 16 1 tH 1H 1H 1G 

Pap OS EO ESO eer eOp eT: Or NS) 2S) OX st SD, AQ eS SS) Sf So) La eS Gy 

Se SON west est estes stg CY, UG Ry GS LG ECS) “RO GOs KO AO! t= RS RS GAS, BS CD 

1 1 

= 1 

1 De ee 

= 2165 : 

a 56 4 
ee hes 17) 19013) 8-8 ae ae = | 59 
ope 3 13 1819 Gar gaat Ee ee 70 
VN et 1 POG HIG WT 1Oje7 —— 2) VA a Ti 
Vee eT 8 1813. 17. 84 3 5 8 Se 73 
HO) | | gig ie an aera WaOb en dome, Ord At ee 69 
| Sea oMlOL bo ya O64 a Te ed Ag 
S| (ti em esl el ee Dron a: Ae el ee re 15 
Ome ee eave alle ele wi af ee ME SD 
5 — ee (OSes ae = Se ee ee 1 6 
ee a ee De Aiba [yee ee ice eee 4 
Oe |e ner ee ee ee ee Sk oe 1 


TABLE 68. Glasgow. Height and Weight of Girls of Group D. 


Weight of Girls of 9°5—10°5 years 
Height eee yd potas 
WW UD My AQ AQ WO ID ID WD ID ID W ID iD 
iS > 1 
e) a) a) 
41§—| — 1 — — SS SS 1 
42, | 2 — i Sse a _ 3 
eal he = = ee ener ae 
Abby a. lle at UGens Hote a Lees — — — 9 
45 5, — 5 2213 1—-—— 2Z— — — — —- — — — 16 
HS 5 aes UALS ey OSE, Ssh ots}e a} ee — — SS 44 
Ula eA AG Ged no, ook 3 SS 68 
48 ,, —— 4 4 612 919 15 4 3 2—— ~— — — — — — — — — — 78 
Loe. — — —— 4 713 21131714 6 4 2 — 102 
50.,, | — — — — — Gree As opal SA 7, — il — — 92 
él, J|————— — = IG Gi 18 2 64 
52 45 fF — — — — — — — — é 2 8 62 21—-—1%1—— 42 
2.3 213 —-—— — — — 1 18 
— 2 By i 6 k= 12 
| — al — 2 — 6 
eS ee ee eee ee ee See) Se 1 
24 30 49 61 57 52 560 


338 


Height and Weight of School Children in Glasgow 


TABLE 69. Glasgow. Height and Weight of Girls of Group D. 
Weight of Girls of 10°5—11°5 years 
wel Ca 
yor lon = ame ee Ss ee ee eee 
eg |. eae a AS ee ae 
aay Wes, Nor Me Mites 5 2 Wa re 2 
La) —— 1 2 Lm 4 
Je. | 3.19 2 1 oe SS SS SS eee 11 
Ue = 0 96. ae gel ee oe =o Sees 
4g, [ee Ll -4 510 4827053 1 ie eee 39 
Or ———— 1 9141614 5 5 38 21 1’ — — — — — — — 72 
20. ;, —— 1 —— 4 7 8141015 7 1 131 1TH— 1 ~— — — — — — 74 
41, J — — — — — 1 6 6 412141011 4 5— 2 2 — — — — — — — — 77 
Gy ie Nee See 1.413 8 8 810 °3 2°20 = oo = ee 
Bore oo a a ee ae 8 3.77 3059" oe Gael eee 55 
TS Raien| Pee, fon, ey Le 391-347. 5 29'9 4 3 oe eee 
Lem (enema ee a ee, ee at Sa 3156 1. 3.49) 2 1 18 
Cauca (eee seer ee Say, hat Q: 4. = SE ae 8 ie a Ss ee eee 
Td Ws eae es Se eS ee i 2 
a ee ihe 4 
Se eS a 1 
Totals} 1 2 9 10 15 24 44 43 44 54 47 43 34 35 30 1418138 4 5 3 2 1 2 — 1 | 498 
TABLE 70. Glasgow. Height and Weight of Girls of Group D. 
Weight of Girls of 11°5—12°5 years 
Height Totals 
a a a a a 
eh Ss fee Sa ee ce ee 
6i— a 4 
eee 2 i 15 
48 ” 4 = = ll 18 
on 7-5 11 el 36 
50 ,, a 12 1 ied 9.02. 55 
areas 8 514 Ose 78 
52 ,. = 8 By a) 9S) 1 63 : 
58 == 9 ee) 78 
ob 5; — 5 21-17-15 4 95 
55 —— 1 45/6; 72 5 10 53 
56. eee IGS Be Sib} 47 
evans eG 4 27 
aoa ee 4 11 
59 ,, ee al i 
nial) 5 
6. 1 
62 ” The 
68: ,; 1 
O4 ” Ti 
Paes zs 
| Totals} 4 15 30 42 61 77 76 65 62 47 26 38 594 


E. M. ELpERTON 339 


TABLE 71. Glasgow. Height and Weight of Girls of Group D. 


Weight of Girls of 12°5—13°5 years 


i Vege ee ee ee ee ee 6 
- By a hy hs Se a a 16 
., 3) cit heey Weare = —— 9 
.; 8 9 FT 4— 2 2—- —- — — — ~— — — 33 
» Creo e Say 68a 4 |, 37 
5) 2 10 14 13 8 2 Loy’ ~~ ~~ ~—~ ~— — 51 
» — 6 4 19 1 13 6 1 3— ~ ~ ~— — — 67 
3 1 1 38 11 22 9 10 4 4 ~—~ —~ —~ 1 ~— — 66 
> —-— 1 6 18 11 14 9 8 1 1%1—-— JI 71 
” — — — 38 383 12 20 12 1446 6 2 1—— I 76 
” — — — — 4 7 4711 1 2 6 1m i — 46 
” — — — 1 2 38 4 7 56 56 22 1 — — 32 
5 Se ee ee a en ee Ee 14 
Mt ss —_- — — — 3 2 38 1.1— — 10 
52, fe he ee ae i = 2 
63" ,, oS ee a — aH 1 1 —_ 2 
— SS 
Totals 538 
TABLE 72. Glasgow. Height and Weight of Girls of Group D. 
Weight of Girls of 13°5—14°5 years 
TEESE PE alba rlPs P aeoca 
ID ID WH 1D 1D 1D ID ID 1H ID 1S ID 1H 1H HH 1H 1H WH | 19 
AMA HH BDH ~ HH DH AK HTH DDB HF > 
Sig bb © S Bee a Cope CON OO COMES OG! A a Se SE 9 
i a rr SS ar a SSS SSS SS eS 1 
op een me See 3 
Pag me pace SUR aA SMe ye ey yt Ne ere ae ey See 2 
— — — — — J ~— ~~ — — — 1 
ee ae eS 10 
io Nee as 10 
OMe ees ee ete 32 
ag eee et 35 
Gare) fee = =e a ee 45 
ee 15 51 
= eee ees 10 a bees eal Ar 
= a 5 52 
pe re eo See 3 39 
ee 1 eee as ee ee | 0) 
es 2S a a 1 in 
Se a ee a 7 
2 oe ee Se ee, eg Se ee 1 
aa pe eee 2 
Totals} 1 1 3 6 7 19 43 33 41 47 50 24 30 20111310 5 2 2 — 1 369 


NUMERICAL ILLUSTRATIONS OF THE VARIATE 
DIFFERENCE CORRELATION METHOD. 


By BEATRICE M. CAVE anp KARL PEARSON, F.RBS. 


In 1904 Miss F. E. Cave in a memoir on the correlation of barometric heights, 
published in the R. S. Proc. Vol. Lxxiv. pp. 407 et seq., endeavoured to get rid of 
seasonal change by correlating first differences of daily readings at two stations. 
A similar method was used by Mr R. H. Hooker in a paper published some time 
later in the Journal of the Royal Statistical Society, Vol. LXVII. pp. 396 et seq., 
1905. This method was generalised by “Student” in the last number of 
Biometrika (Vol. X. pp. 179, 180). He showed that if there were two variates 
x and y, such that 

x= (t)t+ X, 
y=fO)+ ¥, 
where X and Y are the parts of # and y independent of the time ¢, then the 
spurious correlation arising from a and y being both functions of the time could 
be got rid of by correlating the ditferences of # and y, and that ultimately, when 
m is sufficiently large : 
Tamg Amy = VamHyamty = etc. = 1yy, 


so that the correlation of 2 and y, free from the spurious time (or it might be 
position) correlation, Le. ryy, could be found by correlating the successive differ- 
ences of « and y. When the correlations of the differences remain steady for 
several successive values, then we may reasonably suppose that we have reached 
the correlation 7yy*. 

This method is still further developed by Dr Anderson of Petrograd, who in 
a valuable memoir published in this Jowrnal has provided the probable errors of 
the successive ditference correlations of a system of variables : 


xX; AGS O05) XG 
Vis Vay OO) NG. 


* Having been in communication with ‘‘ Student,” while he was writing his paper, I know that the 
interpretation put by Dr Anderson (Biometrika, Vol. x. p. 279) on ‘‘Student’s” words (Ibid. p. 180) is 
incorrect. ‘‘Student”? had in mind, if he did not clearly express it, the ultimate steadiness of 


. . c 7 
T amy Amy for a succession of values of m. K. P. 


Bratrick M. Cave and Karu PEARSON 341 


where the correlations of random pairs of values of the variates, or the product sums, 
S{(X,— X)(X,—-X)}, 


SHCis= OrOe = 9a)e 
are both zero. 


Dr Anderson has further provided us with the values of the standard deviations 
of the successive differences, Le. 


aud ¢ 


Tamx Amy? 


which represent the ultimate values of o,,,, and o Amy? when we have carried m so 
far that the time effect has been eliminated. 


The new method appears to be one of very great importance, and like many 
new methods it has been developed in a co-operative manner, which is a good 
reason for not entitling it by the name of any single contributor. We prefer to 
term it the Variate Difference Correlation Method. 


With the exception of a few illustrations given by “Student,” no numerical 
work on the correlation of the higher differences has yet been attempted. It is 
clear that much numerical work will have to be undertaken before we can feel 
complete confidence in our knowledge of the range and of the limitations of the 
new method. We have yet to ascertain how far in different types of material 
a real stability of difference correlations is ultimately reached, and how far 
various assumptions made in the course of the fundamental demonstration apply 
in dealing practically with actual statistical data. One of the most important 
assumptions made if there be n values of the variates is that arising from the 
reduction in the number of values as we take the means which occur in successive 
differences, and a like assumption is made in the case of standard deviations. 
Thus for example: 


Ue ede © ee 
{8(X) =X, 


n-1 
but = S(X 5 — X54) = (X1— Xn) /(n— 1), and will not be sensibly zero, although 
Te ke cal 


it is assumed to be, unless n be very large. Similar remarks apply to the sums used 


‘ erat 2 : ey ee 1 n a = 
in the standard deviations, i.e. we assume in the proof S i) = oar js (Xe a) 
eal coe float 


Ultimately with the mth differences we come in the proof to relations of the 


type 
1 


NES iit 


3 my g (X 5) 
1 Ny 


and 
nM = 1 id Van) 
re - § (X,). 
il 


nm—-m 4 ? 


Biometrika x 44 


342 Illustrations of the Variate Difference Correlation Method 


Now such relations will undoubtedly be very approximately true, if the X’s 
are random variates uncorrelated to each other, and provided m is small compared 
with x. These conditions seem amply satisfied when we proceed to fourth or sixth 
differences in barometric pressures, taken, say, over ten or twelve years; the 
addition of four or five daily pressures will hardly affect sensibly either the mean 
or the standard deviation. But such extensive data, while not only involving a great 
deal of labour in the difference work* are not those which, perhaps, most frequently 
demand the attention of the statistician, whether he be economist, sociologist 
or a student of scientific agriculture. In such cases it not infrequently happens 
that the available data only provide a range of 20 to, perhaps, at most 50 years ; 
and we need to discover whether there is a true relationship between our variates, 
apart from a continuous change in both due to the time factor. At present 
accurate statistics of annual trade or revenue, or satisfactory annual demographic 
data hardly extend at most beyond a period of 50 years. Very often—under 
even approximately like methods of record—we shall hardly have more than 
twenty years’ trustworthy returns. Not only has the method of record been 
changed, but the conditions of transit and trade may have been immensely 
modified and in a manner which we could not suppose to be even approximately 
represented by a continuous function of the time. 


The object of the present paper is to illustrate the theory of the variate 
difference correlation method in its present stage of development on a short series 
of economic data, in order to test what approximation there is in such short series 
to stability, and further how nearly Dr Anderson’s values for the successive 
standard deviations apply to such cases. We have selected as our data ten 
economic indices of Italian prosperity for the years 1885 to 1912, together with a 
“Synthetic Index,” formed by taking the arithmetic mean of the ten economic 
indices referred to. These eleven indices are given by Professor Georgio Mortara 
in an interesting memoir: “Sintomi statistic1 delle condizione economiche 
qd’ Italia” which was published in the Giornale degli Economisti e Rivista di 
Statistica, for February, 1914, and form Tabella I, of that memoir, which we 
here reproduce in part as Table I. The indices in each case are obtained by 
dividing the returns for any year by the means of the returns for the years 
1901—05, inclusive, and multiplying as usual by 100. 


The indices are for returns of (1) Gross Receipts of Railways, (i1) Shipping, 
loaded and unloaded at the ports, (i11) Effective Revenue of the State, (iv) Inter- 
national Commerce, Value of Imports and Exports, (v) Number of postal 
Letters and private Telegrams, (v1) Amount of Stamp Duties, (vii) Savings Banks’ 
Returns, (viii) Impo: tation of Coal, (ix) Gross returns of consumption of Tobacco, 
(x) Returns of Coffee imported. Professor Mortara has drawn attention to the 
very high correlations of these individual indices with each other, and of each of 
them with the “Synthetic Index.” The latter correlation is, however, to a certain 


* A discussion of the correlations of the higher differences in barometric pressures will we hope be 
shortly issued. 


Ce a 


Beatrick M. Cave AND Karu PEARSON 343 


extent spurious. For if J,, Z;,... J be the individual indices and J, the synthetic 
index, then J,=75(44+ 4+ ... + Ji) and any individual index J,, if there be no 
correlation between such individual indices, would give 


1 a 7 T i! T \2 
7b Us — Ls) La — La) = z9,, 8 Fa — 40) 


I 
9. 
a 


2 I Ti 1 2 2 2 
Ca poss SU; —I; = 100 (Cie Ose eed) 


and accordingly 
1 72 
my 


; Wie 2 : 
Cae toe UG ea aac 


eee) 


co 
Tho 


TABLE I. 


Professor Mortara’s Table of Index Values of Italy. 
Numerical Index (Mean 1901—1905) = 100. 


| 


A | “a » vw | | | S.5 

Paani eee Soul ste Vee eta | ia | 8 | eg | 2 eae 
Year z a ® 3 & oa} as ax Ss ois 2 Opes ae 

eee eo acess! Sees e | O ) Ss |S, | ee ee 

a B a 20 aS ES) | ess a BA 4g | 

} | 
1885 61 |-- 63 78 72 | 38 82 47 53 82 98 67'4 
1886 62 63 |. 79 | 74 40 87 53 52 86 94 69-0 
1887 68 73 | 82 78 42 94 BY 64 88 91 73°5 
1888 Omg ples” 83 =)" 162 43 98 57 69 86 81 72:0 
1889 71 Me | 285 70 45 99 58—| 71 86 78 74:0 
1890 (2 78 | 86 66 46 97 59 | 78 87 81 75°0 
feomaneer2 72 | 85 \| 60 |: 48 | 96 | 60 | vo | 88 | 80 73°1 
1892 lye 7 86 | 64 bl 96 64 69 89 80 745 
1893 "On 970 8 | 64 | 55 95 65 66 90,5], 73 Toes 
1894 72 a 72, 86 | 63 57 93 65 84 89 71 75'2 
1895 73 76 | 89 | 66 60 92 68 a 88 70 759 | 
1896 75 ves 90 | 67 64 94 70 rp So ee Tec 
1897 78 80 90 | 68 | 68 96 73e |e 76 Site (ro 79:1 
1898 81 84 STE aan 183 96 75 | 79 | 89 78 Sp | 
1899 86 88 | 93 | 88 75 96 70. iN 87 \ar91 82 | 86°5 
1900 89 89 | 94 | 91 78 96 83 | 88 92 82 | 88:2 
NOON |) 90" <I, Ol" “96. 1h 92. | 852 || (96."| 87.4 86 | 95 | 92 91-0 
1902 967 |) 99" | 198 95 | 93° | 97 92 | 96 | 97 | 94 95°7 
1903)" | 10 | 103 | 99° | 99 | 103 | 99 99 99 | 99 | 102 | 100°3 
1904 | 105 |-102 | 101 | 103 | 111 | 101 | 107 | 105 | 102 | 103 104°0 
WoOpsaiel0s | 105" 105 | 10k | 0S" i07 | tts | 114 | 106 | 109 108°8 
1906 | 119 | 123 | 108 | 132 | 107 | 115 | 127 | 137 | 109 | 118 119°5 | 
1907 | 126 | 125 | 108 | 144 | 114 | 120 | 144 | 148 |,115 | 125 126°9 
1908 | 134 | 129 | 113. | 139 | 122 | 120 | 155° | 150. | 124 | 132 131°8 
NOOR 139) 40 121 } 149° 132 193, 168 | 165° | 131 | 140 140°8 
1910 | 146 | 146 | 129 | 159 | 140 | 130 | 180 | 166 | 138 | 147 148°1 
HOU \al56 156: | 135 | 167" 1149 \136 [18% | W71 | 144. | 153 155°3 
1912 | 165 | 169 | 188 | 179 | 158 | 141 | 192 | 179 | 151 | 160 163-2 
{ | | | | 

Mean | 94°8 | 116°3| 97°6 | 96°4 | 82°3 | 103°3 | 95:9 | 99-0 /100°6  98°6 96°5 


344 /llustrations of the Variate Difference Correlation Method 


which would have a substantial, but spurious value amounting to ‘316 if 
Cp Op Gy If there be high correlation between the individual indices, of 


course, the correlation of each individual index with the arithmetic mean or 
synthetic index will also be high. Thus our Table IV shows it to range from ‘952 
in the case of Coffee to ‘998 in the case of Railways. Possibly a third of this 
correlation may in some cases have a spurious origin. But the individual indices 
are very highly correlated together; only two such correlations are below ‘9, and 
the lower of these two is as high as ‘885. We are accordingly left with fifty 
correlations ranging from ‘885 to ‘997 between the individual indices, and if we 
accept these as true measures, then it is clear that any one of these ten indices 
might be used as a reasonable index of Italian prosperity; it would for practical 
purposes be idle to calculate them all or to table their arithmetic mean. 


But the high correlations found lay themselves open from our present stand- 
point to some suspicion of being solely due to the fact that during the 28 years 
under consideration Italy has progressively increased in population and_ac- 
cordingly the consumption of innumerable goods and the means of interchange 
have all grown together with the time. In other words the correlations we give 
under the heading of “quantities” in each separate section of Table IV are very 
high solely because the individual indices are variates increasing one and all as 
continuous functions of the time. 


The material therefore seems especially suited to the application of the 
variate difference correlation method. For example, the correlation between the 
indices for tobacco and savings is ‘984; are we to interpret this to signify, that, 
if there are large savings this means that much will be spent on tobacco? Or, 
is this high correlation simply in whole or part spurious, merely indicating that 
both savings and consumption of tobacco increased markedly with the time ? 
Actually the correlation of first differences drops from ‘984 to ‘766, that of 
second differences is negative if insensible, while from there onwards it steadily 
increases negatively, till with the sixth differences we reach —‘431, which seems 
to indicate that, when time has been eliminated, expenditure on tobacco in any 
year means less money saved. Again the coffee and tobacco indices appear very 
highly correlated, ‘955, but by the third ditference correlation we have reached 
about a third of the relationship, ‘319, which is scarcely altered in the sixth 
difference correlation, 326; we may assume therefore that there is probably a 
moderate “organic” relationship between the expenditures on coffee and tobacco, 
but the association is nothing like as close as would be suggested by the corre- 
lation of the raw indices. 


The work has been done in the following manner: The successive differences 
of the indices up to the sixth were found. The means and standard deviations 
of these differences were calculated, and the correlations were then worked out 
in the product-moment manner. This involved the very laborious work of 
determining 385 coefticients, and then to these coefficients were added the 


Beatrice M. Cave AND Karu PEARSON 345 


probable errors as found by Dr Anderson’s formulae. These probable errors are 
of course those of the correlations of A™X and A”Y, and will not be the correct 
values for the probable errors of the correlations of A™# and A™y, until A" = A”"X 
and Amy =A”Y, ie. until m is sufficiently great for ¢ to have been eliminated. 


’ 


Further their accuracy depends on the vanishing of the means of the differences 
or on the equalities of the sums like 
1 


w= n 
S Xe) ah | . (XG), ete. 


1 
n—1l 


which, while true on the average, will only be approximately true in the actual 


instance if n be large. We give in Table II, the Mean Values of Index 


\ 


Differences. 
TABLE II. 
Mean Values of Indices and their Differences. 

S| = oF ine val, ‘1 7 
| 2 bo ES | i hes (3s $8 
ees | 8 ok Ss 2. “ip 2 P ey 3p fet 
ee ee eee ase 2 | 3s i.e |eee8 
seles les Gales |ae | sa) °.| Ss | 8 |e 88 

ee fs Et et | | @ <5 

us zal : : 

Quantity ... | 94°8 | 116°3| 97°6 | 96:4 | 82°3 | 103-3 | 95-9 | 99-0 100°6 | 98°6 96°5 
Ist difference |— 3-85 |— 3-93 | — 2:22 | 3-96 |— 4-44 | — 2°19 | —5:37 | — 4°67 | — 2°56 | — 2°30 — 3°55 
prides 5, — 35/— 50|— 08|- -38/— 27) -00/+ 04/— -35|- ‘12/- -42) - -24 
3rd, + 16/+ -28)4+ -20}— 08] -00/+ 12/-— -08|+ -40)/— 12) ‘00; + ‘09 
4th 3 — 33/-— 88/-— ‘13}-1:17 00)/— -28)/+ -04]/-— ‘79|- ‘16;-— 42} — ‘40 
5th ” + 67/4226 + -26/+2°52/4 30 00 |— °52)/41°87)+ 22/4 87) + °83 
6th 5 SCD Ne 2L00 F008) 735) — 05 |4+2°00;-— °95)-168) -—1°10 


It will be seen from this table that the means of the differences are far from 
zero even when we have reached a difference for which we may suppose the time 
to have been eliminated. This arises from the smallness of the series dealt with 
and shows us that we ought not to anticipate more than a rough accordance with 
theory, or only an approximate steadiness, for sums like 


may grow less and less steady as m increases, 


Similar considerations apply to the standard deviations of the differences. 
These will not at first obey Dr Anderson’s formulae because they are the values 


for om, and T amy? and when we have taken m sufficiently high for 


and o i to be theoretically equal to c,,,, and o,n5, 


Tamy Am 


346 Illustrations of the Variate Difference Correlation Method 


and accordingly the correlation should have begun to be steady, then some failure 
to obey Dr Anderson’s formulae will arise, because the means of the differences 
are not truly zero and equalities of the type 


el n 
n—-1 


Use 


=i 1 1 
8 Ce) — ar s (X2;.5)) ete 
will not be satisfied when n is relatively small. 


Dr Anderson gives the value of omy I terms of o°,, but we do not of course 
know o?,, which will be very different from o?,, and can only practically be found 


from the value of o? itself, after that value has become equal to Oo my» Le. after 


Ar 
steadiness has set in. In order therefore to test the formulae we have formed the 
ratio of 

O AmylF jm, from m=1 to 6. 


: Dies : ; - 
This equals seater in Dr Anderson’s formulae for Can Cention and therefore 


we have a good measure of the approach of Aw to AX, or of the growth of 
steadiness as apart from the correlations. The following Table III gives the 
values of the ratios of the squares of the standard deviations, theoretical values, 
actual values and the mean value for each of the differences of the ten 
individual indices. 


TABLE III. 
; 2 
Values of o?4my/O%4m-1y and their approach to mer ; 
S234 | fa Mean of 10 
§ se 3 , | oS! ¢ 8 2 ewe a 8 & Index 
lm | 2-2 a8 e a S gq a £2 g 3 2 BSI Difference 
3 8 aa on] ate | ag S = s Pe S 2 = Standard 
rch ean a en) 25 a n a Deviation 
= & Ratios 
1 012) °012 031] -019) °038} -009]) *040) -O10} °035 | °022) -036 | 025 
2 | 3 705 | °708 | 1°834| °763|1°720| -799 | °585 350 | 2°074| °352| °843 1:003 
3 | 3°333 | 37107 | 2°816 | 3093 | 2°124 | 3°032 | 1°959 | 1°660 | 2-214 | 3°075 | 2-213 | 3°307 2°549 
4 |3°5 3°167 | 3°128 | 3°174 | 2°747 | 3°213 | 2°597 | 2:008 | 3°106 | 3°379 | 3:025 | 3-619 3°000 
5 | 3°6 3143 | 3°449 | 3°189 | 3:020 | 3°104 | 3°010 | 2°328 | 3°275 | 3°580 | 3-117 | 3°701 3°177 
6 | 3°667 | 3°149 | 3°711 | 3°195 | 3:164 | 2°881 | 3:208 | 2°499 | 3°455 | 3°682 | 3°101 | 3°791 3°269 


It will be clear that until we reach the ratio of the square standard deviations 
of the third and second differences, there is no general approach to steadiness. 
After m=8, however, for m= 4, 5 and 6, the ratio of the values for the mean of the 
series of individual indices to the theoretical value is ‘86, ‘88 and ‘89, respectively. 


Thus, there is increasing approach to agreement in the observed and theoretical 
values, but the approach is slow, and we believe that there is greater steadiness than 
is really indicated by this test. The source of this apparent unsteadiness lies we 


BrEATRICE M. Cave AND Kart PRARSON 347 


think in the relative largeness of m compared with n (Le. at a maximum 6 as 
compared with 28), rather than in our not having taken sufficiently high 
differences. 


We now turn to the correlations. These are given in Table IV, the actual 
values of the standard deviations of the quantities and their differences being 
recorded along the diagonal cells, while the other cells contain the correlations of 
each pair of variates and of their successive differences. 


We will now consider these correlations in detail. 
(a) Synthetic Index (Arithmetic Mean) with Individual Indices. 


We see at once that the Synthetic Index is highly associated with Shipping 
(>°85), with Importation of Coal (? >°75), and International Commerce (? < ‘68), 
and fairly highly with Revenue (c. ‘55). On the other hand the sixth difference 
correlations with Post (c. 15), with Stamp Duties (c. ‘24) and with Savings (> °23) 
are all such (i) that they might well have arisen from the spurious element in the 
Synthetic Index correlations, and are all less than their Andersonian (steady 
value) probable errors. Almost the same may be said of the Railway Index ; it 
is not beyond suspicion of being spurious, and is scarcely significant having regard 
to its probable error. The Consumption of Coffee is also not very closely associated 
with the Synthetic Index ; it is only about twice its probable error (427 +°205), 
and a good deal of its value may be spurious. Further in the case of both 
Coffee and Railways, the correlations are still falling between ‘04 and ‘05 for each 
difference. The last individual index remaining is that for Consumption of 
Tobacco and although the correlation of sixth differences is not really significant 
it is negative (— ‘247 + 235), and is exhibiting a steady negative rise. 

Stripped therefore of the common time factor the Synthetic Index will be seen 
to be no very appropriate measure of trade, business activity, and spare money for 
savings and luxuries. With Post, Stamp Duties and Savings, it has probably only 
a spurious relationship, expenditure on railways has little influence, that on 
luxuries is very slightly significant, or indeed in the case of tobacco negative. It 
is, however, closely related to variations in external trade, i.e. imports (including 
coal), to exports and shipping and to effective revenue. It appears to us that a 
suitable general index of prosperity, which will distinguish between a continuous 
growth in all factors with the time, and favourable and unfavourable fluctuations 
from this growth, can only be obtained, when there has been far more ample 
study of the associations of individual indices among themselves, and of these 
indices after they have been freed from the time factor, i.e. of associations 
between high difference correlations. From this standpoint the study of General 
Index theories is at present in its infancy. 


(6) Railway Index, This index is very noteworthy in the nature of its 
associations after removal of the time factor. We have reached a steady 
correlation (c. 62) with Shipping, but beyond this no values of first class im- 
portance appear. The relation of Railways and Revenue after falling practically 


ce Correlation Method 


ereé) 


te Diff 


1a 


ions of the Vari 


Tllustrat 


348 


Weg toe 


OO PS Ao eS 


GVB-+ZPl- | OGG. FIO. | EPS. +TLT- | 16.49L3- | 961-FC9F. | FOL. FoLr. | 68.68 | 6ez.FPIZ. | LL1.FUrG- | 0¢z. ¥L00.—| cet. FOg9. . ¢ 
VEG. FEES. | CES- FATT. | QBS. FECT. | GZ. FIGS. | L81-FSEr. | L6I1-Fcor- | 9.0 | PIS. 80K. FL EPO, cee. 4 110. GOL. E FPL z aie E 
661-+ POE. | GTS. +FIGT- | 80. FEES. | 90. +6PZ- | ELI-FI9P- | L6I-FLIE. | ZL-93 G6I- +108. | PEI-+199- | FIZ-F8GT- | $80. + LAL. “uy | & 
GLI-FOPE. | OOS FLIL. | L8I-FELZ. | B8I-F99Z- | E9T-ESHF- | 961. FRI. 20-91 GLI. FHPE. | GOL. F289. | FET. #10. | TL0- £908. ple to 
I9L-+GPE. | G8I-FOLO- | 991-+00€- | OLT-FG9Z-. | SGT. FOTP- | Zel-FLz0.-| 08-6 ELI F6ZE- | LOL. F199. | ELT-F9ZZ- | G90. F FOB. epuaae) 
GOI F68G- | QBI-FEEP. | OBIL-FEPF- | OLT-F8IG. | IGT-FI8P- | ect. F9ET. 20-2 9&1. F69E- | 6L0-FCOL | GOI-FI19¢- | 090. F68L- | Bla ast] = 
800: +696- | 800-+196- | 800-+0L6- | G00-+6L6- | C10-+1P6- | S10. F6FG. | €8-G¢ OLO- #Z96- | GOV. F186. | G00. FZ86- | GOO. F8RE- sorqiquune 2 
ie) 
o) 
O1B-FOOF. | LEZ GLI. | GEZ-FOLZ- | FFE. FFGT. —| FES. F GCS. —| E1Z. F ERE. —| GEz. F FIZ. 8L-FT €OL- +991. | 90% +EF- | LLT. # LPG. “ 499 
FEL-F6IF. | LEZ-+FPEL- | GES. F6FS- | 63E- FIST. —| O€Z- F89L- —| GOT. FFELE. —| F1Z- F ROE: Té-8 10l- #992. | 10% F18e- | E91 F FEC. “  yag 
6LI-F8GF- | LOG. FEFZ- | L0B-FOFS- | OLG- F L1Z- —| BIS- FF80- —| BRI. F 18. —| ET. F LEE. 8L-F ZOL- FEL. | O6T-F6ZE. | FST. FPG. «up| & 
GLI-FO06E- | E61-FOIZ- | BEI. FBBZ- | OGT- FLFZ- —| ZOT-F ZOO. | ELT. F18e.— | QL. FPEE. 88-Z IIT FEL9. | O61. FFG | IGT. FF6P- “ pig] § 
OLI- +192. | 181. ZOI- | SLT. FOG. | 641. FPFI-—| ZBI. F090. | G91. 29z- —| ELL. F 6S: 86-1 FEL EPIC. | Bl FPLO. | SCL. FOLE. Hite | 
OGI.+16F- | 880-+999- | 9FI-+6L6- | GLI. + 1FE- EL. + [LP- | LET-+99€. | 981. F69E- 1Z-% LOL: + Z09- LIL: £9F¢. 180. ¥ 699- ‘HIG 4ST © 
0G0-+816- | GO0.+€86- | L00-+FL6- | G00. F886- |O10-+F196- | 100-4116. | OLO. FZ96- 82:91 G00. F066. | C00. *686- | G00. F686. |Sel9TZUeN?) 
= — a | = 
LEG. F BES. | GFZ. FILE. —| MYT FGLG. | 09S. F FEO. — | 0GS- ¥ 600- —| GFZ. F6CO.—| LAT. F2ES. | COL. FOOL. OL. F1Z9. | FLG eee 
GIS. F 86%. | BBS. FE8T-—| OGT-FILG. | 9ET- FGEFO. —| GES. FL90. | Get. F FCO. -| LET. FFT. Pee oc ered Pieacen 300 Tre Z ane a 
61-098. | 9G. FEZ. —| OPI. F8LG-. | 61Z- F990. —| FIS- FOOT. | FIZ- FGLO.—| FEL. F199- | ZOLFEEL | -FETZ 9ET- F919. | LEO. ¥ 188. “ or] 
OLI. +16€- | 66T-FZZL- —| 1ST. F FES. | L0G. F FLO. —| OGI- F6FS- | BGI FLZI- -| SOL. F289. | TIT FeL9. | Te-ZT PEL. + 129. OFO. F 68. epee. 
SGT. +88€. | LLT. FELT. —| 9II-FF09. | IST. #80. —| LOL. #962. | LAT-FZ81-—| LOL. F199. | FEL. FFIEC. 00-2 ZI. +029. | 680. ¥ 888. SUG lee! 
CII. + Geo. | O€1T-F6LF- | 960. 2E9. | ZI. FZOP. | SOL. FE9G. | TCT. F96I- | 6L0-FGOL- | LOL- +209. LT-G o90.F69!. | Lh0-F6e8. | Bra ast] © 
910. +286. | 900-826. | C00. +986. | C00. F066. | 800. L96- | 800. F016. | C00. F186. | G00. £066. FT-62 G00. ¥166- | G00. F966. |Sa1UENy 
—— = ifs —— 
OFS. FOZ. —| HES. FEF. —| FIZ. FEE. | POS. F LEP. —| E€Z- F 19Z- —| 6EZ- FFIS- —| OG. FL00- —| 90S. HaeF- | FET. FIZ. z. + BCE. ene 
GEG. + GEL. —| LES. FGI. —| LET. FOF. | 00S. FORE. --| CEZ- FELL. — | 9BS- F 60. —| GEz. F LL0. 102. 3 188. | FL. F609. ee | eer. F268. ;. une 
61G- + E90. —] HIS. FLGT.—| LLL + BFP. | PEL. FIFE. —| O€Z-FL10- | IZ. FFSI-—-| FIZ. FGI. | 961-F6zE. | 9ET- F9T9- 90-8 GLI. ¥ IGF. “ wy 
GOG- +810. | L61-+F9L-—| FSI. F68P- | CST. + 16Z-—| 261. +6SI- | S61. FECT-—| EI. FL0G- | O61-F1FZ- | PSL F129. 96-4 IST. $608. « pag| & 
GSI-EPbO. | G11. FFOS-—| SEL-FOSG. | GLI. FRI. —| OLT-F9Ge- | TeI-+OFI-—| ext. F9e. | set. +FL0. | ZI1. F029. 2L% ‘| O&l.- Fees Oita 
OOI.¥F09- | 680-099. | €OT- F062. | €60-FOFY. | B60-F9F9. | FEL EEGs. | GOL. FI9G. | TIT. F9FS. | 690. FG! €%-€ | Le0-F P48. | Bld 1 
GLO. + 8€6- | G00. F6L6- | 900. F686. | G00. F966- | O10. F196- | GOO. F6L6- | COO. ZR6- | COO. E686. | COO. LEG. 08:6 $00. F866. SeIgUENY 
— | | | 
£06-+L8r. | CES. FLPZ.—| BOL. FGGL. | LEB. FET. | 9ES- TPZ. | Che. FECT. | cel. Fogg. | 211. FLFG. | 990. F499. | 61z- EEece. ie comer @w 
PSI. +89F. | CES. +FEL-—| ELL. FIL. | LOS. F FOL. | 61Z-FGLG- | Tez. FOFT- | COL. F FPL. eee Pesan eG | Hees et ag S 
91. F609. | 618-090. —| GOI. FGOL- | FI. FO9T- | 961-F8ZE- | L1Z-FROL- | F80.F L182. | FOL. FFG. | 20.188. | GLT.F IGF. ene | tay |e 
OFT. +92G. | GOS. LEO. — | 660. F ETL | 861-F6EI- |) GLT-FL9€. | Oz. FREO. | 1L0. F908. | IGT. F FEF. | OFO.Fg68- | IGT. F¢0G. 69-7 eaeenererle: 
CEL. +F0G. | GBI. F490. —| 160-+60L- | 8L1-F FOL. | 9ST. FEse. | set. F¢90.-| ¢90.F Fog. | ect Fore. | ee. F998. | OT. EEC. FOCEW ies abuse 
£90-+6GL. | €L0.+GEL- | 680-099. | €90-FLLL- | 160-+1¢9- | €€1-FIOF- | 090-4682. | 190.4699. | LEO. +68. | LEO. FFLE. tL-€ BIG 4st | & 
G10. +6. | 900. +4986. | G00-+686- | G00- + 466. | 600: + 996. | 400. FZL6- | GOO. + 886- | 200- +686. | C00. +966. | G00. F 866: 97-82 | seuend| F 
= vie = = | | 
ne 4 a : son | ; eames) ee age te) x x 
aayjog oo0Rqoy, yeo9 SSUTARG ates | 4qsog ‘econeniaee | Onueaary surddiyg [ey senna | 


"SUOT}VIAIG paepurig 


pup SMOLM AQVQol pup SUOUZD]OLLOL) 


‘soopuy wYIT "AT WTAVA 


349 


BEATRICE M. CAveE AND Karu PEARSON 


CG-6G FEE. FOGE- | GHG. FGGT. | OGG. FOFO. | SEB FEE. | OPS. FELT.—| GES. FSP. | OLS. FOO. | LE-FZET- | OFZ. F FOS. —| 40<- + LO. ou 
LG-O€ =| 80G- + FFE. | OES. +EOT. | LEG. +L6I- | LEZ-+IOG- | FES. +60. —| FBS. FEES. | PEL FEF | G1B-FRGES- | GES. FGET- —| FBT. + BOF- ua¢ 
68-ST | I6G1-F89€- | ZIG FOBT. | OBZ. FL00- —| 11G- F G61. | 61Z- F890. —| 661 F FOE. | BLI-FRSP- | I6L- +09. | GIS. FEO. —| LOT. #60E- “ wr} Co 
» GE-8 | SLI. +6PE- | G6T-F9G6I- | GOG- FETO. —| LEL-+FE9T- | GOS. ¥8C0.—| GAT. FOE. | SL4I- FOE. | OLT-FL6e- | ZO’- +810. | 9FI- + 9ZS- “ Pag] & 
66-4 | FOL. ¥6TE. | SL1-FOST- | GOT. FIFO. | IST F680. | ZBI. FHGO.—| IVT. FSFE. | OLT. FL9B- GGL. +88€. | Z8I-+PFO. | Cel. + FOG. “ pug} 3 
00-¢ | 160. +899. | CPI. +€8Z- | FEO. +LE9- | LPL. +E9Z- | OZI-FEFF. | COL F68G- | OGI-FIGE- | E11. FZE¢- | COT. FTO. | 490-F6GL. | BIC IST 
€€-96 | T10- F996. | 610. F186. | C10. FIF6. | 120. FEI6- | 8z0-F E98. | 800. F696- | 00. F816. | 910. FLE6-. | CIO. F REG. | ZLO.F BGG. |SeyTUENy 
| 
| ae 
PGS. + 9ZE- CF: ET PL. FFL. —| FOS. ¥ LEP. —| OFS. + ESL. —| LPZ-FROL- | OGZ- FSO. | Lh. FGI. | 63S. F1LG- —| 9es. EPS. —| es. FLFS.-| “WIE 
806-+FFE-  —-HD-L EST. + PLP. —| L6L- + BOF. —| GES. FEL. —| OES. FFHO- | EEG FLATT. | LCS. FPL. | SCS. FEI. —| L2G. F961. —| BES. FPEL-—| “ yIC 
I61-+89€- | €&-F ISI. +8LF. —| S81. FO8E. —| ET. F99T- —| BIZ-F E80. | SIZ FIGL | L08-FEFZ- | 91Z- FETT-—| PIS. FLSI--| 612. F090-—-|  “ WF 5 
SLI. + 6FE. 6h Z ELT. + BLE. —| 181 FLZE. —| SBI. FOLZ- —| 66I- FIT. | 00Z- FALL. | E61. FOI. | 661. FZST-—| LET FF9T- —| BOS. FLG0.-| “ pre] 
POL. +6LE- 19-T I9T. + 9FE. —| GST. FPFO. —| POL F LTE. —| OLT- F681. | VBI FOLO. | 181 FZOL- | LLL FAT —| SLT. F FOS —| BI. FF9O-—| “pus 3 
160- + €¢9. 8: LOT. +916. | G90-499L. | PEI-FIOP. | OLL. FSGS. | SGI. F EEF. | 880.499. | O€I-FE6IF- | 680. F099. | E10. FEL: Bid 481 
T1O- + ¢¢6. 02-6T 800: +896. | 00-+F86- | L00-FEL6- | 910-F4E6- | 800-F196- | G00. FE86- | 900-F8L6- | COO. F6L6- | G00. F E86 |sarUNd 
| - 
Ch. +EE1-. | P81. FFG. OS-OZT' | LPZ- +961. | 0G3-FZG0. | OSS. +0G0-—| EFS. F ILI. | SES. FOL. | O91 FGOLG- | FIZ FEBE- | SOT. F4GL- “499 
08%. + E9T- | E81. FPLF- 08-29 T€G-+ PPI. | 9E-F0GO-. | 9ES- FLO. —| BZ. FECL. | BBS. FEPS- | OGT- FILE. | L61-+0F- | ETL F ISL. “ wag 
B1G-+98I- | [81-+8IF- 61-€€ 81Z-F8L0- | 81G-F980- | 8IZ- F980. —| 80B- FEET | LOT-FOFS- | OPI. FRLG- | LAT. SPP. | GOL FOL “ Wl 6 
C6I-+961- | E41. #8LE. 90-8T GOG- G00. | 661-FLET- | 66T- FTE. —| LET. FELZ- | GT. FEES. | LET FFES- | FET. FESPr- | 660-FELL- oe EPAETIES 
SL4I-FOGI-. | I91- F9FE. OE: OT B8I- + LG0. —| BL4T-F99T- | GLT- F608. —| 99T-FOOE- | S41. FEGT- | 911-FHO9. | ZEI-+6ZS-. | 160- + 6OL- “ pug) ~ 
CPI. +E8S-. | LET. FT. GIL Gel. +18€- | [€l-+ELF- | LG1--+690-~| 961. F6FF- | OFI-F6LZ- | G60. FZE9- | COT. FO6G. | 680. F099. Bid si 
610. + 1Z6- | 800- + 896: 66-LE 200. +066- | 600-+¢96- | TLO-+ 466. | 800-FOL6. | 100. FFL6. | G00. F986. | G00. F686. | G00. F686.  |SerzFUeNY 
OSG. + 9F0. | FOG. F LEP. —| 1PS- F9GT- 6F- GG B1G-F EGE. | GEB- FOES. | LES. FGLE. | HHS. F FST. —| OCS. F FEO. —| FOS. F LEP. —| LES. F BES: “199 
L6-+L61- | L61- + ZOv- —| 18S. + FFT. OT-2T GOG- F6LE- | 60. FEE. | GET. FIGS. | GBT. FI. —| YES. F EFO- —| 00S. FORE: —| LZ. FFEL- ye Mae 
OG: + 00. —| 8ST. F O8E —| 81Z- ¥ 8L0- 69-9 681 FELE. | OGI-FLZE- | 90B- FEFS- | OLZ. F 11Z. — | 61S- F990. —| FET. F 1FE- —| FIZ. ¥ ODT- “ wr} s 
0G. + €10- —| IST. + LZE- — | GOS. + G00. 6L-€ €8I-+908- | E81-FLOE. | S81. F99Z-. | OGT- FLFS.—| LOG. FFLO- —| CBT. F 16Z- —| Q6T- ¥ GET- “ pie} &- 
G8I-+ IPO. | Z81-F PFO. —| 2ST. FLGO. — GG. OLI-#19G- | GLI. ¥808- | OLT-¥996- | GLI- FPFI-—| I81- F980-—| GLI. F861-—| SAT FP9T- “pug | oe 
V60- + 1E9- | G90. F991. | CEI. F188. 18-F 8OI-+€9¢. | €E1- FOP | OTI-F8IG- | ZLI- FIPS. | SEI. FZOF- | £60. FOF9- | E90. FALL: BIC 1 
G10-+1P6- | G00. +486. | GOO. +066- IL-7 600: ¥€96- | L00-FZL6- | G00. F626. | G00. F886. | 400. F066. | G00. F966. | COO. F166. | SeryTUENY 
886. F ECE. OFZ. F GBI —| 0G. FZGO. | GIS. F Ege. Z6-6 096. F L0.—| 961 FG9F- | PES. F EGS. —| OG. F600. —| Ges. F 19%. —| OES. FaKe- “ U9 | w 
LGB. +10G- | GEE- FEEL. —| 9ET- FOGO. | GOS. F CLE: 82-9 PES. + 860- —| LB. + SGP. | O€- FPO. —| GE. F190. | eZ. FELI-—| 61Z- F GLZ- pee caee 
ITG.+961. | €1Z- F991. —| 8TZ- F980. | 681. F GLE. Il-F PIG. + EOL. —| CLI F19P- | 81S. FF8O-—| PIS- FOOT. | OZZ-FLIO. | 961. F 8BE- “ w]e 
461-+€9T- 881. FOLZ- —| 661. FLET- | E81. F908. 06-2 O61. F FPS. —| C91. F EPP. | GOS-FGOO. | OCT. F6FZ- | LET-F6GI- | CLT. F198. eS 
I8l-+980- | POT-+LTE.—| 8L1- F991. | OLT- + 19%. GZ: GOL. F6EE-—| GGL. FOLP- | Z8T- F090. | LOT-F6Z | OLT-FEGZ- | OCI. FEBE- “ pug} = 
LPI. +€96- | PGI-+19F- | I€1- FIP. | SOL. FE9e. 6-3 89T- +960. | 1ZI-FI8h. | €ZT-+1LF- | 8OT-+E9S | Z60.F9F9. | 160. F199. Bid ST} 5 
1G0.+€16- | 100. +€L6- | 600-¢96- | 600- + £96- 99-FT G60 + 868- | C10. IF6- | OLO-F196- | 800-F496- | O10. F196. | 600.FE96. |soyULNy | % 
| 
9FG. + EE1-— | LP. +801. | OSZ- F080. — | zs. FEE. | OGS. F LZ0. €¢- 16 P6I-FGLP. | €1Z- #88. —| 6FS- F6C0. — | 6ES- FFIS- — | ChS- FEST. 5 
VEG. +760. — 9ES-F FPO. | 9EZ. +LG0-—| 60S. F HES. | FES. F BGO. 20:21 LOI. +GOF- | ZOZ- F PLE. —| 9EB- F FSO. — | 9SS- F 60S. — | LES- + OFI- abe 
616. +890. — 81G-FE80- | 81Z- +480. —| 961. F Ese. | FIS- FZOL- €6-9 L6O1-+L1€- | 881. +188. —| PLZ + 6L0- — | Z1S- FP8T- —| L1Z- + 8OT- , UW) 
GOG. +890. — 66I-+I1&T- | 661-+1E1-—| €81-FL0E- | O61. FFP. 0&:F 961-+ZSI- | GLI. +18: 661- + LZI-—| 861. FEST. — | ZO’. + 8E0- | Bae B 
G81L-+790-— | 9LT-+681- | GLI- F60Z- —| GLI. 80%. | ZOL- F6EE. 10-€ G81. FLZO- —| QOL. FZ8S- —| LLT- FZSL- —| 181. FOFT- ZBI. +990: — “pug 
O61 FOP. | OLI. +699. | LET. ¥690-—| E€T. F107. | BST. ¥ 920. PPE Gl. FOCI. | LET. F99€- | GGT-F96I- | PET. Fess. | eel. FOP. | Bld wT 
860. +988- | 910-986. | 110. #L96- | L00-FZL6- | G20. + 868: 19-GE €10-+6F6- | 400-146. | 800-FOL6- | 900-F6L6- | L00.FZL6. | SorgTqUENd 


45 


Biometrika x 


350 Illustrations of the Variate Difference Correlation Method 


to zero, now stands at something greater than -42 and might rise higher, but 
the relation to International Commerce as a whole is zero, which suggests that 
the goods imported and exported are not in the bulk carried by rail. Further 
althoagh the final value of the Railway and Post correlation is scarcely sensible 
(— 214 + :259), it has been continuously negative from the second difference, and 
thus suggests that increased expenditure on the post means lessened profit for 
the railways. This might be interpreted in two ways: (i) that business con- 
ducted by post or telegram lessens rail intereommunication by person, or (il) that 
in the case of state-railways, there is not an increased profit to the railways from 
carrying larger mails. But still more remarkable are the negative correlations of 
Stamp Duties, Savings, Tobacco and Coffee with Railways; none of them are very 
large, and all but savings, perhaps, of the order of their probable errors. But 
taken as a whole they suggest that when the Italian spends little money in going 
about, then he saves more, or spends more on such luxuries as tobacco and coffee. 
Lastly we have the Coal Index. It might be supposed that a year with great 
coal importation would signify great railway activity, and this-is the judgment 
which would be made from the raw correlations of these variates. But the actual 
facts are exhibited in a correlation still falling at the sixth difference and hardly 
significant having regard to its probable error. The inferences formed must be: 
(i) that imported coal is used largely at the ports of disembarcation or travels 
inland by other than railway transit, (11) that the imported coal is largely used on 
the railways themselves and that its cost is a heavy tax on their resources. 

(c) Shipping Index. As we might anticipate this is highly correlated with 
(i) Railways (c. 62), (ii) Revenue (c. °75) and less highly but very significantly with 
(111) International Commerce (c. 54) and (iv) Coal (c. 58), but it appears to have 
no relation whatever with Post, Stamp-Duties and Savings, and when we come to 
luxuries, their importation is clearly not a factor of shipping prosperity. Neither 
in the case of J’obacco nor of Coffee are the correlations really significant ; with 
the former we have an increasing negative correlation and with the latter a 
decreasing positive one already below its probable error. Thus we see that 
neither directly by bulk of importation nor indirectly by immediate increase of 
consumption, does a rise of shipping mark significant rises in the use of luxuries 
such as Tobacco and Coffee. It would be of interest to ascertain whether in- 
creased consumption of luxuries does not rather follow than accompany favourable 
trade fluctuations. 

(d) Revenue Index. This index as we might expect is fairly highly cor- 
related with Shipping (c. °75). It has relatively small relation to Railways 
(422 +206) at least at the sixth difference and a somewhat similar value 
(c, 42 4°20) for Coffee. Thus the suggestions arise that revenue is but little 
produced by the railways and that coffee is not a very large factor of the custom 
dues. It is astonishing to find, however, that Post, Stamps and Savings have 
negative correlations with Revenue of —°888 +°213, — 255 + 234 and —:154 4+ :244 
respectively, which, if scarcely significant, have been in each case for several 


Bearrick M. Cave and Karu PEARSON Bay 


differences persistent in sign. Even the correlation with Tobacco is small, falling 
and insignificant (< 115 +247), and that with Coal which might be supposed to 
be high as marking good trade times is hardly significant although apparently 
rising (? > -270 + 232). Lastly the correlation of Revenue with International 
Commerce is again small, falling, and insignificant (< ‘214 +°259). Thus Revenue 
or the “entrato effectivo dello stato” seems to provide an index which has little 
valuable relation to any other characteristic of prosperity beyond shipping. 


(e) International Commerce. Here we find no single final individual index 
correlation greater than °54, which is that for Shipping. The next most important 
correlations are with Post (>°47) and Stamp Duty (c. 46). With Railways the 
correlation is zero, and with Revenue also falling and insignificant. With Savings, 
Coal, Tobacco and Coffee the correlations are all insignificant; in fact in the last 
three cases not only are the values less than their probable errors, but they are 
still falling. It is thus clear that in Italy the total of Exports and Imports is 
no measure of all-round prosperity, they do not immediately increase either 
savings or the consumption of luxuries. 


(f) Post and Telegrams. Here we have the lowest series of correlations we 
have yet reached, Post values have no significant relation to fluctuations in 
Railway (c. —:20 + °24), to Shipping (— 059 + 249), Stamp Duties (—027 + °250), 
Coal (— 050 + 250), Tobacco (+ 108 + 247) or Coffee (— 183 + :246) Indices. It 
is significantly correlated only with International Commerce (> + ‘47 +'19) and, 
perhaps, significantly with Savings (+ 336 +°222) but negatively with Revenue 
(c. —'38 +:21). In short the number of letters and telegrams in Italy is hardly a 
mark of any other favourable fluctuation in prosperity, beyond International 
Commerce. 


(g) Stamp Duties. This Index is correlated positively and significantly with 
International Commerce (c. +'46 +°20) and positively, and doubtfully with Savings 
(c. + °85 +°22). It is correlated insignificantly and negatively with Raalways 
(—'261 +233), Shepping (— 009 + -250), Revenue (—'255 +°234), Post (—'027 +250), 
and Tobacco (—'129 + 246); it is correlated positively and insignificantly with 
Coal (+ (052 + ‘250) and Coffee (+ °222 +238). Thus again freed from continuous 
time changes, fluctuations in the Stamp Duty Indew are of small value as a 
measure of contemporaneous general prosperity. 


(h) Savings Bank Index, There are practically only two correlations of any 
importance with Savings and these are both negative, namely those with Railways 
(— "431 + 204) and with Yobacco (—°431+4°204). Hence it would appear that 
when the Italian people is in a saving mood, it spares on transit by rail and on 
the consumption of tobacco, and when it expends on these luxuries, then it does 
not save. Savings have small and possibly not significant correlations with Post 
(> +°33 4°22) and Stamp Duties (< +'353 + °219), and insignificant and positive 
correlations with International Commerce (>+°27 + °23), Coal (> +19 + °24) 


45—2 


352 Illustrations of the Variate Difference Correlation Method 


and Coffee (c.+°05 +°25); they have insignificant negative correlations with 
Shipping (< — 03 4°25) and Revenue (< —‘15 + °24). 


Savings are thus—apart from continual time change—no very satisfactory 
measure of general prosperity, and a fluctuating increase is usually accompanied 
by a reduction of luxuries. 


(t) Coal Index. The importation of coal has little relation to any factor of 
prosperity besides Shipping (c.+°58 4°17). With Railways the correlation is 
not quite double the probable error and the value, even at the sixth difference, 
appears still falling. The correlation with Revenue only just exceeds the probable 
error (+ '270 +232). With International Commerce (+171 + °243), Stamp Duties 
(+ 052 +°250), Savings (+ 196 +241) and Coffee (+ 152 + °245) the correlations 
are less than their probable errors, small and in some cases still falling. With 
the Postal Index, the correlation is negative, insignificant and falling. Alone in 
the case of the Tobacco Index does the correlation appear to be nearly as 
significant as in that of Shipping, but it is negative and increasing* (—°514 + 184), 
while in the case of Shipping it was steady. It is singular to find that Coal, the 
increased import of which should mark increased industrial activity, is, beyond 
the naturally influenced Shipping, alone effectively associated with the con- 
sumption of Tobacco. 


(j) Tobacco Index. This is of considerable interest as marking the association 
of indices of trade prosperity with the consumption of a luxury. With four 
exceptions J'obacco is negatively correlated, although often insignificantly, with 
the other indices. Revenue (+°115 + °247), International Commerce (+ °015 + ‘250), 
and Post (+ °108 +:247) are all positive, insignificant, and in the first two cases 
still falling. The correlation with Coffee is positive and might, perhaps, be 
significant (+°326 +°224), but it appears to be still falling. With Coal and 
Savings there are probably significant negative correlations (—°514+°184, and 
— 431 +°204 respectively); with Railways (— ‘243 + ‘236), Shipping (— ‘271 + °229) 
and Stamp Duties (— ‘129 + ‘246) there are insignificant negative correlations, but 
they tend to confirm each other in sign. Thus we see that the consumption of 
tobacco can hardly be considered as a measure of general prosperity; it appears 
to be greatest when trade conditions are unfavourable, and in particular when 
savings are least and manufacturing conditions as measured by the importation of 
coal are slack. The result suggests the pipe of the unemployed at the street 
corner, rather than the increased expenditure of the fully occupied artisan. 


(k) Coffee Index. 'This is another luxury and the results are very similar. 
There appears a significant correlation with Revenue (+ °400 + 210), which might 
easily be explained, and there is a falling but possibly significant correlation 
with Tobacco (+326 +°'224), With all other indices the relationships are 


* It is, perhaps, hard to believe that so much smuggling could be carried on in colliers, that it would 
seriously affect the profits of the tobacco monopoly ! 


—_—_  — 


BeaTRicE M. Cavk anp Kart PEARSON 353 


insignificant. Railways (—‘204 +240), Shipping (+ °2382 +°237), International 
Commerce (+142 +245), Post (—'133 4246), Stamp Duties (+°222 + 238), 
Savings (+°046 +250) and Coal (+°152+°245). Apart therefore from the 
general increase of consumption with the time, during which time the general 
prosperity of the nation has increased, it would not appear that the consumption 
of a luxury has any organic relationship to prosperity. We do not find that a 
favourable trade fluctuation is associated with increased consumption of luxuries. 
In fact the suggestion arises that in the case of tobacco the consumption may be 
greater in a period of depression. 


Conclusions. While we lay no special stress on any of the results suggested 
by the difference correlations above studied—far more intimate economic know- 
ledge of Italian affairs and methods of measurement would be requisite—we yet 
venture to insist on one or two general considerations. 


The very superficial statements, so frequently met with, that such and such 
variates, both changing rapidly with the time, are essentially causative will 
doubtless cease to have any scientific currency, directly the method of variate 
differences is fully appreciated. We shall no longer assert that the fall of 
the phthisis death-rate can be off-hand causatively associated with the con- 
temporaneous rise in the number of persons dying in institutions, or that the 
increased expenditure on luxuries is necessarily a measure of increased national 
prosperity. 


If we turn as in the present paper to the actual correlations of the indices 
themselves, we find in every case an arid and scarcely undulating waste of high 
correlation. No one can obtain any nourishment whatever from the statement 
that the Tobacco Indez is correlated with the Revenue Index to the amount of :983 
and with the Suwings Bank Indeaw to the extent of ‘984! The organic relationship 
between these variates is wholly obscured by the continuous increase of all three of 
them with the time. But when we proceed to sixth differences and see that the 
consumption of tobacco has little, if any, relation to revenue, and is associated 
substantially but negatively with savings, we seem to touch realities, and realities 
of some worth. Again what can we learn, if we are told that the Shipping Indes is 
correlated to the extent of ‘99 with both the Revenue and the Savings Bank Indices? 
We might imagine, that increase of shipping was not only the primary cause of 
increase in Italian revenue, but also the essential origin of any increase in the Italian 
peasant’s and artisan’s savings! An appeal to the variate ditference method shows 
how fallacious such imaginings would be! An examination of the sixth difference 
correlations shows that while prosperity of the revenue is closely associated with 
trade as measured by shipping (77), the correlation is not nearly perfect; on 
the other hand there appears to be no significant organic correlation at all 
(— 154 + ‘244) between the prosperity of the revenue and the savings of the 
Italian populace. As we have noted a knowledge of local conditions and methods 


354 Illustrations of the Variate Difference Correlation Method 


of reckoning quantities might enable us to put other and, perhaps, more luminous 
interpretations on our results. But there can be small doubt that to proceed from 
the actual correlation of such indices to the correlations of their higher differences 
gives the feeling of clearing away the sand of the desert, and reaching all the 
ordered arrangements of an excavated town below; the slight undulations of the 
waste above are really fallacious, and enable us to appreciate nothing of the 
actual topography of the city. 


The method is at present in its infancy, but it gives hope of greater results 
than almost any recent development of statistics, for there has been no source 
more fruitful of fallacious statistical argument than the common influence of the 
time factor. One sees at once how the method may be applied to growth problems 
in man and in lower forms of life with a view to measuring common extraneous 
influences, to a whole variety of economic and medical problems obscured by the 
influences of the national growth factor, and to a great range of questions in 
social affairs where contemporaneous change of the community in innumerable 
factors has been interpreted as a causative nexus, or society assumed to be at 
least an organic whole; the flowers in a meadow would undoubtedly exhibit 
highly correlated development, but it is not a measure of mutual causation, and 
the development of various social factors has to be freed from the time effect, 
before we can really appreciate their organic relationships. 


In the present paper we have dealt only with very sparse “ populations” (only 
28 values of the variates), but this has enabled us to consider not only a very 
large number of correlations, but to see the practical influence of terminal con- 
ditions on our theory. This may we think be summed up in the statement that 
the Andersonian formulae for the standard deviations will hardly in many practical 
cases be more than very roughly approximated before the size of the population 
becomes too small to make the deductions reliable. Further in most cases our 
difference correlations have hardly even with the sixth differences reached a steady 
state. Possibly they have done so in the cases of Rail and Shipping, Shipping 
and Post, Shipping and Coal, Revenue and Post, International Commerce and 
Stamp Duties, International Commerce and Savings, Savings and Coffee, and in 
one or other additional pair. But in the great bulk of instances there is still a 
more or less steady rising or falling appreciable in the difference correlations, and 
all we can really say is that the final value, the true ryy, will be somewhat 
greater or less than a given number. From an examination of the actual 
numerical working of the correlations, it appears to us that the terminal values 
are in the case of these short series of very great importance. It is further clear 
that the theory as given by “Student” depends upon certain equalities which 
are not fulfilled in practice in short series. We await with much interest the 
complete publication of Dr Anderson’s work, and hope to find a fuller discussion of 
the allowance to be made in short series for the influence of the terminal state of 


Beatrick M. Cave anp Karu PEARSON 355 


affairs* on the steadiness of the series and on the approach to the standard- 
deviation formulae. But apart from these lesser points, our present numerical 
investigation has convinced us of the very great value of the new method of 
Variate Difference Correlations. 


* For example if we measure X from its mean, 


J] “1 = afi n n m 
oa pang 8 (Xe Kou)? (AR P= 5 (8 (9) - Ky 8 (A) MP) - AN 
1 1 


~n—-1 


n-1 : r, 2 , V"\2 
since S (X,X,,;) is by hypothesis zero, Sg ee Ng teary, 1 {o2x,— 4 (X?+X,?)} —(4X)?. The first term 
1 2 or : 


: aoe Hi n—1 = = 1 r r 
207y. gives Dr Anderson’s value of o?,y. Now AX equals nisi 8 (X, — X544) = (X,-X,). Thus the 


v—1 


ee. 2 ail 
remainder is ——> [ ox,-3 oe? a ala 


i (X,- x, | . Now the average value on many trials of 


4 (X72+X,2) will be oy, and of (X;-X,)*, 20, so that the full value may be 2 77x and small 


2 
(nv —1) 
for n large; but for n small as above such a relation as o®, y=20%, and the similar but more complex 
relations of the standard deviation formulae for the higher differences need not hold for any individual 
case, and thus the steadiness of the difference correlation series, and the approach to the Andersonian 
formulae are very far from attained. 


AN EXAMINATION OF SOME RECENT STUDIES OF THE 
INHERITANCE FACTOR IN INSANITY 


By DAVID HERON, D.Sc.* 


In the last few years a number of studies of the inheritance factor in insanity 
have been published in America, Germany and England. The value of investigation 
of such a topic cannot be overestimated. We are quite certain that the prevalence 
of insanity is not falling; many of us indeed believe that the statistics suffice to 
demonstrate that it is substantially increasing, and that we can attribute this 
increase not in the first place to the intenser strain of modern life, but to the 
greater power of modern treatment to check or temporarily cure attack, and thus 
allow wider possibility of reproduction to members of affected stocks. Indeed the 
problem seems closely associated with an essential difficulty of modern civilisation, 
the greater protection of physically and mentally degenerate stocks unaccompanied 
by any adequate limitation of their thereby increased power of procreation ; the 
inheritance factor thus tends to aid the relatively greater survival of the socially 
unfit. The studies we have referred to would be of great importance from this 
aspect of eugenics if (i) the data were collected without conscious or unconscious 
bias, and (ii) the inferences drawn from them followed logically from the data thus 
collected. 


Unfortunately it is not only in the interpretation of statistics that adequate 
training is required. It is equally important that in the actual collection of them 
we should proceed, not only free from the bias which arises from the hurried 
acceptance of dogmatic theories of heredity, but what is often still more needful, 
free from the bias which is almost certain to waylay our progress, if we have not 
initially considered with trained insight the fallacies which may result from our 
method of recording or even tabulating our material. The day of the amateur in 
svience is gone; no one now pays any attention to men who propound elaborate 
atomic theories or stellar hypotheses, without having had preliminary training in 
physical or astronomical science. There are still, however, some who appear 
willing to accept the statement of statistical data or the inferences drawn from those 


* This paper formed the second portion of a lecture given at the Galton Laboratory on March 3, 1914. 


D. HERon 357 


data by men who have clearly had no adequate training in statistical science. The 
craniologist, the anthropologist, even the biological student of heredity and evolu- 
tion are recognising that a statistical training is needful for the true interpretation 
of many of the facts in their special fields of research. The physiologist still 
appears to believe that he can deal with the average effects of diverse dietaries or 
the pathologist with the “ mass-phenomena” of the hereditary factor in insanity 
without any training in statistical method. A physicist might just as logically 
assume that without mathematical training he could give an adequate mathe- 
matical account of a physical phenomenon, or a cosmic theorist suppose that he 
was effectively furnished for astronomical research by the perusal of a popular 
primer on the stars! The statistical calculus cannot be mastered by any easier 
road than the differential calculus, or, to put a more apt illustration, statistical 
training is as needful a preliminary to the handling of statistics, as time spent 
in a physiological laboratory to the effective handling of tissues. In twenty years 
it will be unnecessary to insist on these points, they will be universally recognised 
in the courts of science; but at present it is not only necessary to reiterate 
unpleasant truths, but to emphasise their validity by illustrations which bring 
home forcibly to scientist and layman alike the danger of amateur statistical 
handling. To state that a man is in error is not sufficient, if he continues time 
after time to repeat his assertions, apparently under the belief that incessant 
repetition will convince the world of the value of his theories. 


In the case of the inheritance factor in insanity we are not dealing with any 
purely academic question of science. We are up against one of the most difficult 
problems of modern life, where true advice is of urgent importance to the nation 
as well as to the individual. It is not only the medical man but the layman who 
seeks guidance in the question of the marriage of members of insane stocks, and a 
laboratory like the Galton Laboratory knows how often advice on such points is 
sought. It is disheartening when help is rendered to the seeker to be faced with 
the criticism: “ But Professor says I may marry if I take a wife of sound 
stock,” or “Dr recommends marriage, although my father was insane, because 
I am over twenty-five and still sane myself.” When teaching of this kind, arising 
solely from false interpretation of defective data, is spread widecast in a dozen 
different papers or journals, it is not sufficient to issue a brief statement of its 
futility. It is needful to give it the coup de grdce by a more lengthy criticism of 
its fallacies and their illustration in a form more likely to impress the imagination. 
The attempt is made in this paper to deal with only one of the authors, who have 
contributed fallacious eugenic rules to those seeking knowledge on the influence of 
the hereditary factor in insanity. 


In a long series of papers Dr F. W. Mott, Pathologist to the London County 
Asylums, has stated that when the children of insane parents become insane, they 
do so at a much earlier age than did their parents, and on the basis of this assertion 
he has drawn some very sweeping conclusions for practical conduct. Thus in the 

Biometrika x 46 


358 Recent Studies of the Inheritance Factor in Insanity 


British Medical Journal of May 11, 1912 (p. 1060), he states that “this signal 
tendency of insane offspring to suffer with a more intense form of the disease and 
at an early age, as shown in the above figures and tables, is of great importance 
for the following reasons: first, it is one of Nature’s methods of ending or mending 
a degenerate stock; secondly, it is of importance to the physician, for he can say 
that there is a diminishing risk of the child of an insane parent becoming insane 
after he has passed 25, a matter of great importance in the question of marriage ; 
thirdly, it is of importance in connection with the subject of social surgery of the 
insane, for when the first attack of insanity occurs in the parent the children for 
the most part have all been born....Sterilization would therefore be applicable to 
relatively few parents admitted to asylums.” 


Put briefly, Dr Mott’s views are that in “Antedating” or “ Anticipation,” in 
this alleged tendency of the offspring to become insane at any earlier age than 
their parents, we have Nature’s method of purifying degenerate stocks, that the 
children of insane parents who are still normal at the age of 25 may safely marry*, 
and that it is useless to take any special measures to limit the reproduction of the 
insane since nearly all their children are born before the onset of insanity. 


These conclusions, if proved to be correct, would be of the utmost importance 
to the Eugenist. If the Law of Antedating or Anticipation really acts in the 
way Dr Mott has suggested, then it would seem to be unnecessary to take any 
special Kugenic action in the case of the insane and indeed the “ Law” has already 
been used in support of this view. Thus in a leading article in the British Medical 
Journal+, which deals with Dr Mott’s work, it is stated that “This intensification 
of mental disease in the young—this ‘ anticipation’ as it is called, which is one of 
Nature’s methods of ending or mending a degenerate stock, is specially important 
in connection with sterilization, as the figures given by Dr Mott show that when 
the first attack of insanity occurs in the parent the children have for the most part 
all been born. Sterilization, therefore, would be applicable in relatively few 
cases. 


It is at least obvious that when views such as these are taken of the “Law of 
Anticipation,” it merits the most careful examination. Let us consider, then, first 
of all, Dr Mott’s presentation of the case for anticipation. For some years past 
Dr Mott has been engaged in the collection of cases in which two or more members 
of a family are or have been resident in London County Asylums, and has noted 
wherever possible the age of onset of the insanity. Information was thus obtained 
regarding 217 pairs of father and offspring, and 291 pairs of mother and offspring 
and the results are summed up in the following table. 

Thus in comparing the age at onset of insanity in father and offspring, we find 
that among the fathers only 1:4°/, became insane before the age of 20, while among 
the offspring the percentage was 26°2. These figures are also shown graphically in 


* See for instance Problems in Eugenics, p. 426. 
+ May 11, 1912, p. 1089. 


D. Herron 359 
Figs. 1 and 2*. Here the horizontal scale represents the age of onset in 5-year 
groups—the vertical scale the percentages of cases occurring in each age group. 
TABLE I. 


Percentages of Cases whose First Attack of Insanity occurred 
within Various Age-periods. 


Age-periods Father | Offspring | Mother Offspring - 
Percent.|} Per cent. | Per cent. Per cent, 

Under 20 years ig 26°2 0°6 27°8 

20—24 years... 0-4 18-0 374 15°7 

pting 1-4 | 18-0 44 18-2 \ Adolescence 
30—84 ,, 9°6 13:0 4s) 13°4 

35—39 ,, 11°5 73 9-2 10°0 

40O—-AL,, 9-2 6-4 10°3 58 

4I—49,, 14:3 6:0 12°0 3'7 | Involutional 
50—54,, 17°5 0-9 12°3 2°4 t period 
55—59 13°8 3°7 14:0 Py) 

60—64 ,, 10°1 — 11°6 1°3 

65-—69 _,, 5:0 _- 8°8 — 

70—74 ,, 46 0-4 31 — 

75—79 ,, 0-4 _- 1:3 = 

80 - 0:4 — 0°6 — 


I have been obliged to follow Dr Mott in treating the “under 20” group as a 
5-year group as otherwise my diagrams would bear no resemblance to his, but this 
procedure is far from satisfactory when such a large proportion of the cases in this 
group are congenital cases in which the age of onset should be taken at 0 years. 
The tables and diagrams show that among the parents more than half the cases 
occur after the age of 50, while among the offspring, more than half occur before 
30, and this is taken to prove that there is Anticipation or Antedating in 
Insanity. 


This will perhaps be made more evident if the percentages of those who became 
insane before the age of 25 are given in each case. Among the fathers, 2°/, and 
among the mothers, 4°/, became insane before the age of 25. Among the off- 
spring, on the other hand, the percentage is 44. Another way of looking at the 
matter is to take the average age of onset of insanity in each case. Dr Mott 
gives a Table showing these averages but unfortunately has omitted the congenital 
cases so that the extent of anticipation is considerably under-estimated, and the 
form in which the data are given does not permit of an accurate calculation of the 
actual averages. From the information given it appears, however, that the average 
age at onset of insanity among the parents is about 50 years, among the offspring 
about 26 years, showing an anticipation or antedating of some 24 years. 

* I am very grateful to Miss H. Gertrude Jones, the Hon. Secretary of the Galton Laboratory, for 


the diagrams which illustrate this lecture. 
46—2 


360 Recent Studies of the Inheritance Factor in Insanity 


et 


at Ons 


1 and 2. Diagrams to illustrate the Distribution of Age 


+ and Offspring. (Mott.) 


ty in Paren 


of Ingani 


Fathers [_] 
Offspring FY 


= 
[i 


D. HERON 361 


Now these conclusions, if satisfactorily demonstrated, would obviously be of 
the highest importance, but they were immediately challenged by Professor Karl 
Pearson in a letter which appeared in Nature of November 21, 1912 (p. 334). 
Professor Pearson’s letter is as follows: 


On an Apparent Fallacy in the Statistical Treatment of “ Antedating” in 
the Inheritance of Pathological Conditions. 


The problem of the antedating of family diseases is one of very great interest, and is likely 
to be more studied in the near future than ever it has been in the past. The idea of antedating, 
i.e. the appearance of an hereditary disease at an earlier age in the offspring than in the parent, 
has been referred to by Darwin and has no doubt been considered by others before him. Quite 
recently, studying the subject on insanity, Dr F. W. Mott speaks of antedating or anticipation 
as “Nature’s method of eliminating unsound elements in a stock” (“Problems in Eugenics,” 
papers communicated to the First International Eugenics Congress, 1912, p. 426). 


Iam unable to follow Dr Mott’s proof of the case for antedating in insanity. It appears to 
me to depend upon a statistical fallacy, but this apparent fallacy may not be real, and I should 
like more light on the matter. This is peculiarly desirable, because I understand further 
evidence in favour of antedating is soon forthcoming for other diseases, and will follow much the 
same lines of reasoning. Let us consider the whole of one generation of affected persons at any 
time in the community, and let , represent the number who develop the disease at age s, 
then the generation is represented by 


Noy Nyy Ng_ vos Ngys0- N00, Say. 


Possibly some of these groups will not appear at all, but that is of little importance for our 
present purpose. 


Let us make the assumptions (1) that there is no antedating at all; (2) that there is no 
inheritance of age of onset; thus each individual reproduces the population of the affected 
reduced in the ratio of p to 1. Then the family of any affected person, whatever the age at 
which he developed the disease, would represent on the average the distribution 


Po, PN, Pla, +++ PNs5 +++ PN100- 
The sum of such families would give precisely the age distribution at onset of the preceding 
generation. 


Now let us suppose that for any reason certain of the groups of the first generation do not 
produce offspring at all, or only in reduced numbers. Say that g, only of the n, are able 
to reproduce their kind; then of the older generation, limited to parents, the distribution 
will be 

Jo ot M121 + GoN2t «e+ YgNgH «+» + Y100%1005 
but the younger generation will be 
D (GoM + G12 + YoNgt 0. + Ye Nat --- + G100%100) (pM +... HNg+... +2100); 
i.e. the relative proportions will remain absolutely the same. 


The average age at onset and the frequency distribution of the older generation, that of the 
parents, will be entirely different from that of the offspring and will depend wholly on what values 
we give to the g’s. If frequency curves be formed of the two generations they will differ 
substantially from each other. This difference is not a result or a demonstration of any 
physiological principle of antedating but is solely due to the fact that those who develop the 
disease at different ages are not equally likely to marry and become parents. 


362 Recent Studies of the Inheritance Factor in Insanity 


A quite striking instance of the fallacy, if it be such, would be to consider the antedating of 
“violent deaths.” Fully a quarter of such deaths in males, nearly a half in females, occur before 
the age of twenty years. Consider now the parents and offspring who die from violent deaths ; 
clearly there would be no representative of death from violence under twenty in the parent 
generation, and we should have a most marked case of antedating, because the offspring 
generation would contain all the infant deaths from violence. 


In the case of insanity, is the man or woman who develops insanity at an early age as likely 
to become a parent as one who develops it at a later age? I think there is no doubt as to the 
answer to be given ; those who become insane before twenty-five, even if they recover, are far 
less likely to become parents than those who become insane at late ages—many, indeed, of them 
considering the high death-rate of the insane, will die before they could become parents of large 
families. Now Dr Mott took 508 pairs of parents and offspring, “collected from the records of 
464 insane parents whose 500 insane offspring had also been resident in the County Council 
Asylums,” and ascertained the age of first attack. As at present advised, it seems to me that 
his data must indicate a most marked antedating of disease in the offspring, but an antedating 
which is wholly spurious. There is, I think, a further grievous fallacy involved in this method 
of considering the problem, but before discussing that I should like to see if my criticism of this 


method of approaching the problem of antedating can be met. 
KARL PEARSON. 


Biomerric LABORATORY, 
University Cotieer, Lonpon, 
November 11, 1912. 


Dr Mott has referred to this letter in his Report for 1912*, but it will be more 
convenient to deal with his reply after we have examined the method by which his 
data have been collected and the use made of the data. Let us consider first of alli 
how the data were obtained. Dr Mott in describing his material says that it 
consists of a collection of cases in the London County Asylums where two or more 
persons are related to one another. Thus Dr Mott has dealt—not with a series of 
complete pedigrees in which every member is included, whether insane or normal, 
but with a series of cases in which two or more members of a family are known to 
have been in London County Asylums. No notice is taken of those who are 
normal throughout their lives and no allowance is made for those who are normal 
at the time the record is made but who may afterwards become insane. 


Do cases selected in this way provide a complete or impartial view of the facts? 
Some of Dr Mott’s own comments on his data throw a considerable amount of 
light on this point. In his Report for 1909+ he says: “From all the Asylums 
I have received valuable reports, but in the case of the older asylums it has been a 
matter of the utmost difficulty to trace the records of so many years back,” and in 
his Report for 1910+ he says, ‘Some of the asylum authorities have gone through 
their case books for a number of years back, but the results have not been 
satisfactory owing to the difficulty of obtaining particulars without a living repre- 
sentative of the family being resident in the asylum—for instance, 110 old cases 

* Annual Report of the London County Council for 1912, Vol. 11. p. 62. 


+ Twentieth Annual Report of the Asylums Committee of the L.C.C., p. 90. 
+ Twenty-first Report of the Asylums Committee of the L.C.C., p. 94. 


D. HERon 363 


reported from Bexley have been rejected as the relatives in the other London 
County Asylums could not be traced, for no instance has been included unless full 
particulars could be obtained.” 


It is thus clear that not all the cases could be traced and that there was 
special difficulty in tracing the older cases. What is the effect of a selection 
of this kind? A study of the following hypothetical cases may serve to throw 
some light on this point. 

TABLE ILI. 


Anticipation or Antedating in Insanity. Hypothetical Laamples to 
show the Effect of Dr Mott's Selection of Cases. 


| 
First Example | Second Example 
Mother: Born Sod ss we oi te. | 1873 1833 
Married... a ey aay oe 1893 1853 
Became Insane and admitted to Asylum 1913 1873 
Age at First Attack seis ise Pot 40 40 
Died fee oe if ave eee 1914 1874 
Son: Born... ace 900 See Scie oc 1894 1854 
Became Insane ... “ine ies nor 1894* 1914 
Admitted to Asylum... apo oe 1914 1914 
Age at First Attack ... a ne 0 60 


The mothers in those two examples have exactly parallel careers. In each 
case the mother became insane at the age of 40 and only lived one year in the 
asylum. In the first case the son was a congenital idiot but was only admitted to 
an asylum at the age of 20. The age of onset in this case is taken at 0 years and 
the case shows marked “ anticipation.” In the second case the mother also became 
insane at the age of 40, the son not till the age of 60, 40 years after his mother’s 
death. The second example thus tells against the Law of Anticipation. Are 
these two cases equally likely to appear in Dr Mott’s data ? 


In the first case mother and son are in the asylum at the same time and were 
admitted within a year of each other. It is very improbable that the relationship 
would escape notice and such a case is almost certain to be recorded. In the 
second case, however, the son is not admitted to an asylum till 40 years after his 
mother’s death. Even if the family remained in the same area for 40 years after 
the mother’s death, it would obviously be very difficult to connect the histories of 
mother and son. This case, which tells against the Law of Anticipation, is 
almost certain to escape notice. A spurious anticipation or antedating is thus 
inevitable owing to the method of collecting the data. 


It has also been pointed out that Dr Mott has made no allowance for those 
who are mentally normal at the time the record is made but may subsequently 


* Congenital Idiot. 


364 Recent Studies of the Inheritance Factor in Insanity 


become insane, and this introduces further spurious anticipation. Another hypo- 
thetical example will perhaps make this clear. Let us take the case of a mother 
with six children, five of whom have become insane as follows: 


TABLE IIL. 
Children 
Mother eis bal 5 | 

ipl | 2 eat biel wee) 5 6 
Born 506 dot one 1830 1850 | 1852 1854 | 1856 1858 | 1860 
Became Insane ... | 1860 — 1872 1896 1914 1888 1860+ 

| Age at Onset of Insanity | 30 —* 20 | 42 | 58 30 0 
| a 


The extent to which this family would show anticipation or antedating 
would depend very largely on the time at which the record was made as is shown 
in the following table. 


TABLE IV. 

Age of Onset of Insanity in 
Date of Average for | Amount of 
Record Children Anticipation 

Mother Children 
1860 30 0) 0) 30 
1872 30 0, 20 10 20 
1888 30 0, 20, 30 16°7 13°'3 
1896 30 0, 20, 30, 42 23 0 7 
1914 30 0, 20, 30, 42, 58 30 (0) 


If the case were noted in 1860 then the age of onset of insanity in the mother 
is 30 years—of the child 0 years—a clear case of anticipation, and nothing 
would be known of the fact that four other children will afterwards become insane 
and will bring the average age of onset in the children up to 30 years—exactly 
the same as that of the mother. Nor is the record even now complete for if the 
eldest child ever becomes insane, the age of onset in his case must be at least 
64 years and this will further increase the average age of onset in the children. 
It is thus clear that in dealing with incomplete families and ignoring the possi- 
bility that those who are normal at the time of record may afterwards become 
insane, Dr Mott has introduced a further spurious anticipation or antedating. 


If we examine carefully the first pedigree given by Dr Mott at the Eugenics 
Congress}, we see clearly how probably much of the anticipation recorded by 


* Alive, 64 years of age and still normal. 
+ Congenital Idiot. 
+ Problems in Eugenics, p. 413. 


D. Heron 365 


Dr Mott has arisen. Unfortunately this is the only pedigree for which sufficient 
details have been given to enable its completeness to be tested. The pedigree 
and Dr Mott’s description of it are as follows : 


“A.B., an alien Jew, aged 54, was admitted to an asylum for the first time suffering from 
involutional melancholia ; he has a sister who has not been in an asylum, but, as events turned 
out, bore the latent seeds of insanity. The man is married to a healthy woman who bore him a 
large family ; the first five are quite healthy, then comes a congenital imbecile epileptic (cong.)*, 
then two healthy children followed by a daughter who becomes insane at 23, then a son insane 
at 22, and lastly two children who are up to the present free from any taint. The sister of A.B. 
is married and has a family of ten, seven girls and three boys; one of the females was admitted 
to the asylum at the age of 19, and since this pedigree was constructed a brother of hers has 
been admitted aged 24. Half-black+ circles are insane. The pedigree is instructive ; it shows 
direct and collateral heredity ; it also shows remarkably well the signal tendency to the 
occurrence of insanity at an early age in the children of an insane and potentially insane 
parent.” 


3 Brothers 
6 Sisters: 


23 22 19 
@ = Insane. 
13 children: 9 Alive, 4 Sons, 5 Daughters. 4 Dead. 3 Insane. 


1 
Cong. 


Fig. 8. Pedigree to illustrate the effect of Dr Mott’s selection of cases. 
F. W. Mott: ‘‘Heredity and Eugenics in relation to Insanity.” Problems in Eugenics, p. 413. 


This pedigree was given as above in July 1912, and in an address previously 
delivered before the Manchester Medical Society on Oct. 4, 1911, Dr Mott gave 
the same pedigree, but without any reference to the nephew of A.B. (brother of 
the girl who became insane at 19) who became insane “since the pedigree was 
constructed,” so that this man became insane between 1911 and 1912 and this 
serves to “date” the pedigree. 


Now it should be noted that at least five of the children of A.B. are over 23 years 
of age and up to the present time healthy. But all these children are alive and if 
any one of them afterwards becomes insane, the average age of onset of insanity in 
the children will be raised—and it is clear that the more incomplete the pedigree 
the greater the amount of spurious anticipation. Again Dr Mott states that in 

* This does not agree with Dr Mott’s pedigree which gives the congenital case as the seventh 


instead of the sixth child. 
+ According to our usual custom, they are represented by full black circles in Fig. 3. 


Biometrika x 47 


366 Recent Studies of the Inheritance Factor in Insanity 


nephews and nieces the age of onset is earlier than in uncles and aunts. In 1911 
this pedigree gives a case in which an uncle became insane at 54, his niece at 19— 
but one year later a nephew who became insane at 24 has to be added, thus 
raising the average and there are eight more children some at least of whom 
may become insane at later ages. As before the incompleteness of the pedigree 
introduces an artificial and spurious anticipation or antedating. The remedy is 
obvious ; we must only deal with completed families. 


A further fallacy involved in Dr Mott’s method of work must now be noted. In 
directly comparing the age of onset in parent and child, Dr Mott has ignored the 
fact that in the parent the incidence of insanity is for all practical purposes 
limited to the age of 20 and over since cases of congenital defect and of adolescent 
insanity hardly ever marry. Among the general population of asylums, however, 
12°/, become insane before the age of 20 and in Dr Mott’s selected data the 
percentage rises to 27—or more than a quarter of the whole become insane before 
20. This in itself causes a very marked spurious anticipation. As Professor 
Pearson has shown (p. 361 above) if we were to investigate the age at death in 
parent and child from accident or violence, we should find the same spurious 
anticipation. 


There are thus three fallacies involved in Dr Mott’s work. In the first place a 
spurious anticipation or antedating arises from the inclusion in the record of 
families whose history has not yet been completed, for those who become insane 
at late ages in the younger generation do not appear. Secondly, even with families 
whose history is completed, those cases in which the insanity of parent and child 
is contemporaneous are far more likely to be recorded than those in which the 
child becomes insane long after the parent*, and thus the cases which show 
anticipation are more likely to appear in the record than those which tell against 
Dr Mott’s views. Thirdly, by directly comparing parent and child, he has practi- 
cally limited one of the two groups which are being compared to ages at onset of 
over 20 years and has thus obtained further spurious anticipation. 


Dr Mott also lays stress on the appearance of insanity in a more intense form 
in the younger generation. “I have proved,” he sayst+, “that there is a signal 
tendency in the insane offspring of insane parents for the insanity to occur at an 
earlier age and in a more intense form in a large proportion of cases, for the form 
of insanity is usually either congenital imbecility, insanity of adolescence, or the more 
severe form of dementia praecox, the primary dementia of adolescence, which is 
generally an incurable disease.” But we have already seen that Dr Mott’s method 
of collecting his data is such that an enormous preponderance of early cases 
of insanity in the younger generation is inevitable and of course such cases are 
largely incurable. Type of disease is very closely related to the age of onset and 


* Dr Mott states (Archives of Neurology, Vol. v1. p. 82) that ‘the main bulk of the cards (i.e. his 
records), however, refer to parents and offspring admitted to the asylums within the last fifteen years.” 
+ Archives of Neurology, Vol. v1. p. 82. 


D. Herron 367 


by selecting the latter we can alter the proportion of any particular type of 
insanity. Dr Mott has obtained his material in such a way that, in the younger 
generation, cases of insanity coming on late in life are much less likely to be 
recorded than those which appear in early life, and hence the early cases are in a 
majority, but the change in age of onset, and consequently of the type, is entirely 
spurious and arises solely from the way in which the material has been obtained. 

We can now deal with the reply Dr Mott has made to Professor Pearson’s 
criticisms. In his Annual Report for 1912 (p. 62), Dr Mott says: “ Professor 
Karl Pearson, writing to Nature, November 21, 1912, ‘On an apparent fallacy in 
the statistical treatment of “ Antedating” in the inheritance of pathological con- 
ditions,’ criticises on mathematical grounds the evidence of anticipation. I do not 
feel myself competent to reply to the opinion of such an eminent authority on 
mathematics applied to biometrics, but it does not militate against my conclusions, 
nor explain away the fact that a large proportion of the insane offspring of insane 
parents are affected with imbecility or adolescent insanity; for granting the 
assumption that there is no antedating at all, we might rightly expect the ages 
at onset of insane offspring of insane parents to be comparable with the ages at 
onset of all the admissions to the asylums during the same period*. This is by no 
means the case, for amongst the insane offspring there is a far greater proportion 
atfected early in life, as is shown in the following figures and curves” (they appear 
here as Fig. 4 and Table V). 


According to these figures the onset of insanity among the recorded insane 
offspring of insane parents is considerably earlier than among the general admis- 
sions to asylums, but it has already been shown that this is due to the fact that 
the data have been selected in such a way that the early cases in the younger 
generation are the most likely to appear. Further, if Dr Mott’s argument be a 
valid one, we might also expect the ages at onset of the insane parents of these 
insane offspring to be comparable with the ages at onset of all the admissions to 
asylums during the same period. This is by no means the case as is shown in 
Fig. 5 below (see also Tables I and V). We see here that the insanity of the 
parents comes on at a much later period than among the general admissions to 
asylums and that there is a far less proportion affected early in life. If Dr Mott’s 
method of argument be sound, he has not only to deal with an antedating of 
insanity among the offspring but also a post-dating of insanity among the parents. 
Both are of course spurious and arise from the peculiar selection of the data and 
from the fact that, owing to differential death-rates, the ages at onset of “ admis- 
sions” will never be the same as the ages at onset of the admitted—i.e. the asylum 
population—at any time. 


* ««We might rightly expect”’ these ages to be different, because ‘‘admissions”’ are not the same as 
the population in the country who have at one time or another been insane. The percentages of total 
cases of acute mania, of senile insanity, of congenital idiocy, and of melancholia, who reach the asylums, 
are not the same. The reader has to distinguish between the population of admissions, the population 
of admitted, and the insane population of the country. A sample of the latter may be reached from 
completed family histories, but not from records on admission or from records of an asylum population. 

47—2 


368 Recent Studies af the Inheritance Factor in Insanity 


45-7 45 
Ko All Admissions [__] Ns All Admissions [_ ] 
/ Insane Children Wy fe Dar 
yyy YY Insane Parents of “Wy 
354 Yy of Insane Parents 35 Insane Children Yd, 
30 30 
g $ 
> > 
i) 
D> Ss 
= 20- = 
5 § 
Q AY 


= 
an 


t 


VaR" 25- 35- 45- 55- 65- 75- 
Age at Onset of Insanity. Age at Onset of Insanity. 
Fig. 4. Fig. 5. 


Diagram to illustrate the Distribution of Age at Onset of Insanity among: 
(1) The Insane Offspring of Insane Parents. 
(2) The Insane Parents of Insane Offspring. 
(3) All Admissions to L.C.C. Asylums. ; 
TABLE V. Percentage Comparison of the Age at time of Onset of Insanity in 
the Insane Offspring of Insane Parents and the General Admissions to 
the London County Asylums. 


MALE FEMALE Toran 
Age at 
Onset of | 4489 direct | 274 insane | 5097 direct | 389 insane | 9579 direct | 663 insane 
| Insanity admissions | offspringof | admissions | offspring of | admissions | offspring of 
during last insane during last insane during last insane 
four years parents four years parents four years parents 
Under 25 20:0 43°8 20°2 44-2 20°1 44:0 
25—84 19°9 27°7 19°9 28-0 19°9 27°9 
35 --4h 21°9 13°8 21-5 16°7 21°7 15'5 
45—B4 Teel 7) 10°2 18°6 7°4 18:2 8°5 
55—64 1353 3°6 12°4 2°8 12°7 3:2 
65—7T4 SPH O-'7 59 0°8 5°8 O'7 
75 45) — 1°6 _- 1135) — 
41 male imbeciles out of 274 offspring 
54 female 389 ” 


”» 
95 male and female , alae 663 


D. Heron 369 


It is possible to illustrate the various fallacies which vitiate Dr Mott’s conclu- 
sions regarding anticipation by considering the age at death of parent and child. 
I do not know whether it is generally recognised that it is exceedingly difficult 
to get any considerable body of data in which the ages at death of a parent and all 
his children are given, for of course the record is incomplete and biassed until the 
death of the last surviving member, and in some cases to get a complete record we 
must trace the history of a family for over 150 years. George the IIIrd, for instance, 
was born in 1738 and all but one of his 15 children were still alive in 1810, 72 years 
afterwards, and the last surviving son, Duke of Cumberland and King of Hanover, 
did not die till 1851, 113 years after his father’s birth—and this is by no means 
an extreme case. In the material I am about to describe I found one case where 
the interval was 160 years. 


Another difficulty which arises is the tendency in practically all family 
histories to omit infant deaths, so that we do not get a complete record. It 
seems probable that the deaths of minors are not represented in such records 
in anything like their true proportion and that the differences are greater than 
might be expected to arise from differences of physique and nurture due to 
class. Thus records of the Landed Gentry give 31 deaths per 1000 males under 
20 years* while actual experience shows 163 to 197 per 1000+. But in the 
records of the reigning families of Europe we get a practically complete record of 
all members and therefore from von Behr’s Genealogie der in Europa regierenden 
Fiirstenhdusert{, I have extracted particulars of the age at death of over 2000 
individuals—all belonging to the 18th century. There was here no selection— 
every child was entered and every family had been traced from the birth of the 
parents till the death of the last survivor. 


Now in Dr Mott’s data we have already seen that cases in which the age at 
onset of insanity in parent and child is contemporaneous are most likely to be 
recorded. We can test the effect of a selection of this kind by investigating the 
effect of selecting, from our data regarding the age at death among those royal 
families, only those individuals who died within a certain number of years of their 
father’s death, and the results are given below in Table VI, p. 370. 


When we deal with the whole of the data, absolutely unselected, every family 
being complete and traced to the death of the last surviving member, we find that 
680 out of 1829 or 37:2°/, died under 20 years of age. Let us now apply a very 
slight selection to the data and reject the 92 cases in which the interval between 
the deaths of father and child was at least 60 years. We find now that 680 out of 
the remaining 1737 died under 20 years of age—or 39:1 °/,. Thus the effect 
of a selection of this kind is to cause a slight increase in the proportion of deaths 
at the early ages. If we make the selection slightly more stringent, by taking only 
those who died within 40 years of their father’s death, the percentage of individuals 
dying under 20 years of age rises to 46°7 and if we go still further and consider 


* See Pearson: Proc. R. S. Vol. 65, p. 291. + Statistics of Families, p. 73. 
Pp ? p 
+ Tauchnitz, Leipzig, 1870. 


370 = Recent Studies of the Inheritance Factor in Insanity 


TABLE VI. 


Illustrating the Effect of Selection of Material on the Distribution 
of Age at Death. 


(Reigning Houses in Europe—18th Century.) 


Children who died: 


All Cases ie 
Unselected within 60 years | within 40 years | within 20 years in their 
Age at Data of their of their of their father’s 
Death father’s death father’s death | father’s death lifetime 
Numbers} °/, |Numbers| °/, |Numbers] °/, |Numbers| °/, |Numbers| °/, 
Under 20 680 | 37°2 680 39°1 680 | 46°7 680 | 6274 648 82°7 
20—389 277 15:1 277 15°9 277 19-0 254 | 23°3 121 15°4 
40—59 336 18°4 336 | 19°3 274 | 18°8 127 Igoe 15 1:9 
60—79 450 | 24°6 395 | 22:7 | 214 14:7 29 2°7 — — 
80and over 86 4°7 49 2°8 | 10 7 — — = = 
Totals 1829 | — 1737 | — 1455 — 1090 - 784 = 
Average | 
Age at 35°9 33°7 26°9 16:2 eu 
Death* 


only those who died in their father’s lifetime, then the percentage rises to 82°7 °/,. 
Looking at the matter in another way we find that the average age at death has 
fallen from 35°9 years to 7°7 years. 


The same facts are given in Fig. 6, which shows that as the selection of cases 
becomes more stringent, there is a regular increase in the proportion of deaths at 
the younger ages. In exactly the same way, the fact that cases where the insanity 
of parent and child is contemporaneous are the most likely to appear in Dr Mott’s 
records causes a spurious exaggeration of the cases of insanity at early ages in the 
younger generation and consequently a spurious exaggeration of the number of 
cases of imbecility and adolescent insanity. 

We can also investigate directly the question of anticipation or antedating 
on this material. In order to avoid the heavy weighting of large families which 
would arise if every child were entered, I have taken only one child from each 
family. Let us consider first of all the distribution of age at death of Fathers 
and their First-born Children. The facts are given in Table VII. 


We have altogether 294 cases in which we know the age at death of a father 
and his first-born child. None of the fathers died before 20 but of the children 


* These averages were calculated, not from the five age groups given above, but from the same 
material classified in 15 age groups. 


D. Herron B yall 


106 out of 294 or 36:1°/, died before 20. The average age at death among the 
fathers is 61 years, but among the children it is only 36 years, so that there is an 
anticipation of 25 years. To borrow Dr Mott’s words, the figures clearly show 
the signal tendency among the offspring to die at a much earlier age than 
their parents; that is to say, anticipation or antedating is the rule. 


Age at Death. 
O= 205 40- 60- 80- 


A OO 
== 40 
as 


20 


Children who 


2 group 


died within 


40 years of their 


vn €AC 


Fathers’ death 


Percentages dying 
h age 


anu ee on 
i ee aS 
died within : * 60 
20 years of their = 40 ae aa 
2 ey 
Fathers’ death &.= 20 oy 


E 
TTT TT 
EET EEE 


S$ 80 a ee 
Fs 
er ge 60 
~o~ 
pba Ae SES 
Fathers’ lifetime $§§ a Soe 
LC a 
QS . — 


Age at Death. 


Fig. 6. Diagram to illustrate the Effect of Selection of Material upon the Distribution of 
Age at Death. (Reigning Houses in Europe, 18th Century.) 


Now in this material there is no selection of families. Every family was taken 
and the age at death of every first-born is known, so that we are only left with the 


372 Recent Studies of the Inheritance Factor in Insanity 


TABLE VII. 
Showing Anticipation in Age at Death. A. Fathers and Children. 
(Reigning Houses in Europe—18th Century.) 


é __ | First-born First Sons who 
Age at Death Fathers Children | Fathers Hadeckularen 
0— 9 — 95 s oo. 
10—19 — 11 os as 
20—29 6 21 4 8 
30—89 16 18 8 15 
40—49 45 31 31 39 
50—59 70 34 54 39 
60—69 tid 37 58 44 
70—79 62 33 46 54 
S80—89 18 14 12 13 
90 and over _ — — 1 
Totals 294 294 213 213 
Percentage dying under 20 0 36:1 0 0 
Average Age at Death ... 61 36 60 59 
| Anticipation de es 25 1 


third of Dr Mott’s fallacies, in that no allowance has been made for the fact that 
the parental group is limited to ages over 20 while more than a third of the off- 
spring die under 20. The effect of this selection can be removed almost entirely 
by taking instead of the first-born child, the first son who married and had at least 
one child. There are in all 213 such cases and we see that there is now no 
anticipation. The difference between the average ages at death is less than a year 
and by removing the artificial selection we have got rid of all anticipation or 
antedating. 


These facts are also shown graphically in Figs. 7 and 8. The horizontal scale 
gives the age at death in 10-year groups while the vertical scale gives the actual 
numbers of parents and offspring dying in each age group. The diagram on the 
left shows marked anticipation, and should be compared with Dr Mott’s diagram 
(Fig. 1) in which the ages at onset of insanity of father and child are compared. 
When, however, we get rid of the selection of cases by taking only sons who have 
had children, then there is no anticipation. 

If we compare the distributions of age at death in mothers and children we get 
exactly the same results. The facts are shown in Table VIII. 


We see that the first-born children died on an average 18 years before their 
mothers, but when we compare the age at death of mothers and the first son in 


REIGNING HOUSES IN EUROPE — I8™ CENTURY. 


U 
Bg 


Zz 7) 

2 '& 

aoe 

= brs 

6) 

> 

q 

ae 

(eo) 

| ol 

4 

(e] 

2) 

= 

"2) 

« 

L 

Le 

O° 

& 

a 

wW 

= 
ao) 

ee <0 
25) 

Le = 

fe) Pi 

Te a) 

Ee & 

WwW * 

a 3 

7s < 

a WwW 
ie) 

Ww Caece 

ee: 
re 
zt 


yn 
a 
WW 
. FS 
Chia: 
ce 
s) 
Zz 
a 
fo) 
a. 
led 
"2 
a 
Le 
we 
re) 
& ee 
© 
a 
e fo} 
Ae 
g 3 
[rs 
fe) & 
ar Wd 
5 g 
a ee 
a 
<u 
& a 
Ww <i 
Oo 
= 


Biometrika x 


D. Heron 


N : 

\ ANN YY) 

WY) \\ \\\ XN \« 

\ \ 
{o>} 
itp) 
z 
i} 
a 
fay 
2 
z= - 
re) z \ 
ae, 
EE 2 
Seo: 
25 
Zz fo) 
a < 
ea T a Sao ea on eee 
0 Q 0 ° 9 ° 0 ° 7) fe} 7) fo) 
0 to) t t io) N N 


AIN3NOD 3Y4 


CHILDREN Z 


a 
—— 


36 


FIRST BORN CHILDREN 


ANTICIPATION 


TET 


AIN3IND3YS 


DEATH 


AT 
Fig. 8. 


AGE 


DEATH 


AGE 


48 


Fig. 7. 


374 Recent Studies of the Inheritance Factor in Insanity 


TABLE VIII. 
Showing Anticipution in Age at Death. B. Mothers and Children. 


(Reigning Houses in Europe—18th Century.) 


First Sons to 


| First-born 


| 
| 
, . | 
| Age at Death Mothers | “Ghijdren’ | Mothers | pave Children 
0— 9 —- 122 — = 
10—19 2 13 hers — 
20—29 47 26 21 8 
380—89 | 49 22 30 16 
4O—49 | 43 32 A) 41 
| 00—o9. |aeom 35 39 | 40 
60—69 | 80 42 52 46 
70—79 leer O2. an anes 4] 54 
SO—89 Po eS ear Wa 10 14 
90 and over | Die eal oH 1 1 
| a | 
| Totals | BE) Sar 220 220 
Percentage dying under 20 6 39°1 5 0) 
Average Age at Death ... | 53 35 55 59 
Anticipation Ss aoe 18 —4 


each case to have children, then the sons live four years longer than their mothers. 
It would have been better in this case to have compared the mothers with the 
first daughters to have children but unfortunately von Behr gives very little 
information regarding the female lives, except in special cases. The figures show 
a marked anticipation in age at death when we directly compare, as Dr Mott 
has done, mother and child, but this vanishes when we remove the arbitrary 
selection. The same facts are shown graphically in Figs. 9 and 10. 


If we combine these figures we can compare the age at death of parent and 
child and the results are shown graphically in Figs. 11 and 12. 


Fig. 11 shows that Dr Mott’s limitation of one of the two generations he is 
comparing to adults, without imposing a similar limitation on the other generation, 
introduces an artificial and spurious anticipation. The average age at death of 
the parents is 56 years and of their first-born children only 35 years—so that we 
get an anticipation of 21 years. If, however, we make the two generations 
almost directly comparable by dealing only with sons who have children—there is 
no significant difference between the two averages (58 against 59 years). 


In these cases we have dealt only with completed families and have taken 
every family without selection. If, however, we consider only the cases in which 


CCA 
KK 


\\ 


OC Of Ome RO aol Ol (FO) ve Ox i 
wo no vt ¢t o S} a si 2 g 2 


ADIN3INOAYS 


i 
: ey 


tS a at 


| 


3 
A 
RAC 
a 

i. 


376 Recent Studies of the Inheritance Factor in Insanity 


. COW 
pst 

een eee 
oa «8 § 


z 
\) 

Mt CL) W 
re 

Parked 

fay ofS) 


Co 


EE, 


D. Herron 


the eldest child died in his father’s lifetime the amount of anticipation is greatly 


increased. The facts are shown in Table IX and in Fig. 13. 


REIGNING HOUSES IN EUROPE — I8™ CENTURY. 


ACE AT DEATH OF FATHERS & OF FIRST BORN CHILDREN. 


WHO DIED IN THEIR FATHERS’ LIFETIME, 


AVERACE ACE AT DEATH OF:- FATHERS 
FATHERS 62 
CHILDREN 10 CHILDREN 


ANTICIPATION §2 


100 


FREQUENCY 


We see here that among the fathers none died under 30 while 87°/, of their 
children died under 30; the average age at death among the fathers was 62-- 


among the children only 10, showing an anticipation of 52 years. 


378 Recent Studies of the Inheritance Factor in Insanity 


TABLE IX. 


Showing Anticipation in Age at Death. C. Fathers and First-born 
Children who died in. their Fathers’ Lifetime. 
(Reigning Houses of Europe: 18th Century.) 


First-born Children 


Age at Death | Fathers | dying in their 
| Fathers’ lifetime 
| 
0-9 | et | 89 
LO==19 | — 11 
20—29 — 16 
380—89 | 6 10 
4O—49 | 20, 5 
50—59 | 32 2 
60—69 3D — 
O79) 32 = 
80—89 10 | a= 
Totals 133 133 | 
| 
Percentage dying under 20 | O a) 
Average Age at Death ... 62 10 
Anticipation oe aae 52 


It is now possible to illustrate the effect of the principal fallacies which vitiate 
Dr Mott’s conclusions. In the first place he has dealt with families which are 
largely incomplete and has collected his material in such a way that cases in 


which the insanity of parent and child is contemporaneous are the most likely to 
be recorded ; in the second place he has directly compared parent and child with- 
out allowing for the fact that practically no parent can become insane before 20, 
while there is no limitation of this kind among the offspring of these insane 
parents. 


In Table IX and Fig. 18 we see the effect of dealing with incomplete families 
in which the children died in their fathers’ lifetime. There we get an anticipa- 
tion of 52 years. If we get rid of the first and second fallacies involving a 
selection of cases by dealing with every family, as shown in Table VII and Fig. 7, 
the anticipation falls to 26 years. If we get rid of the third source of fallacy 
also, by comparing the fathers with the first sons who have children, asin Table VII 
and Fig. 8, then the anticipation falls to less than a year. The Law of Antici- 
pation or Antedating has thus in Dr Mott’s case no foundation, in fact it is a 
spurious result of the mode of collecting and interpreting data. 

Now Dr Mott has not only asserted that this “ Law” applies to insanity but 
has also drawn the conclusion that the offspring of insane parents if still normal 


D, Heron 379 


at the age of 25 may safely marry. In an address delivered before the First 
International Eugenics Congress*, he said: “ You will observe that 47°83 °/, of the 
500 offspring had their first attack (of insanity) at or before the age of 25 years 
and as you see in the curves of parents and offspring, the lability of the child of 
an insane parent becoming insane tends rapidly to fall. Now besides the fact 
that this shows Nature’s method of eliminating unsound elements of a stock, 
it has another important bearing, for it shows that after twenty-five there is a 
greatly decreasing lability of the offspring of insane parents to become insane 
and therefore in the question of advising marriage of the offspring of an insane 
parent this is of great importance. Sir George Savage recently said that this 
statistical proof [sic !] of mine entirely accorded with his own experiences, and that 
if an individual who had such an hereditary history had passed twenty-five and 
never previously shown any signs (of insanity) he would probably be free and he 
would offer no objection to marriage.” 


Now I entirely fail to understand how anyone could recommend marriage in 
such cases, even on Dr Mott’s own figures; for if it be true that 48 °/, become 
insane before 25, it must be equally true that 52°/, become insane after that age 
and this very important point seems to have been forgotten. These figures, 
however, are taken from Dr Mott’s selected data, selected.in such a way that the 
early cases are enormously exaggerated. Until Dr Mott publishes a series of 
complete pedigrees, it will be safer to assume that the age at onset of insanity 
among the offspring of insane parents does not differ widely from that of all 
admissions to Asylums and there we find that only 21°/, become insane before 25, 
and 79°/, after 25. 


But surely at a Eugenics Congress of all places some thought might have been 
given to the mental condition of the children resulting from such matings, before 
advising marriage. It would not have been difficult for Dr Mott to have extracted 
all the available cases of this kind from his collection of pedigrees, i.e. all cases in 
which an individual had an insane parent and was normal at the age of 25, and so 
have discovered the probable fate of the offspring from such matings. 


Unfortunately the details given by Dr Mott regarding his pedigrees are usually 
so scanty that little use of them can be made, but two at least show the danger of 
the matings Sir George Savage and he sanction; these two pedigrees were given 
by Dr Mott in his lecture on Heredity in Relation to Insanity, delivered to the 
members of the London County Council. The first is shown in Fig. 14. (It 
appeared as Fig. 11, p. 18 of Dr Mott’s lecture.) In the first generation a man 
who became insane at 70 had four children. The eldest, a girl, became insane at 
68 and was therefore normal long after the age of 25. Dr Mott does not state 
whether the marriage of this woman preceded or followed the onset of insanity in 
her father, but even if her father had become insane before her marriage, Dr Mott 


* Problems in Eugenics, p. 425. This is one of many illustrations of the evil done by that Congress ; 
attention was directed and much weight given to hasty statements and ill-digested material. 


380 =Recent Studies of the Inheritance Factor in Insanity 


would have raised no objection to the marriage since the woman herself was not 
insane. There were in all six children from this marriage of which Dr Mott would 
have approved. Two became insane, three were blind and five are said to have 


been paupers. 


= Insane P= Pauper B= Blind 


Fig. 14. 


el 


Til 


IV 


ie 3 
®&) = Tuberculosis gy = Suicide 
& = Insanity Con = Congenital 


ZG) = Unknown 


D. HErRon 381 


The eldest child remained normal till the age of 34 and although both his 
parents became insane Dr Mott apparently would not have objected to his marrying. 
He did so and one of his children became insane and eight out of nine are said to 
be paupers. These nine children are apparently still young so that their ultimate 
fate is still uncertain. 


The second pedigree I shall quote was given as Fig. 28, p. 33 of Dr Mott’s 
lecture, and appears here as Fig. 15. 


A man who had an insane father and an insane grandfather became insane at 
the age of 55. He was therefore normal at the age of 25* and Dr Mott would 
have sanctioned marriage in his case. He actually married twice. His first wife 
was tuberculous but not insane; they had two children, both insane. His second 
wife was normal and it is definitely stated that there was no insanity in her 
family; they had five children and one of these became insane. Yet Dr Mott 


would permit the children of insane parents to marry if only they are normal 
at the age of 25! 


Again, Dr Mott has stated that it is useless to attempt to limit the fertility of 
the insane since most of their children are born before the onset of insanity, 
and therefore before any action can be taken. From his statistics of relatives in 
L.C.C. Asylums, Dr Mott has calculated the proportion of offspring who were born 
after the first attack of insanity in the parent and found that “Forty-six offspring 
out of 581 were born after the first attack of insanity in the parent, 1e., 7°9°/,. 
That is to say in the case of 529 insane parents, the birth of only one-twelfth of 
their 581 insane children would have been prevented by sterilisation or life segregation 
of the parent after the first attack of insanity. These figures refer to the offspring 
which become insane, but there are a large number of offspring which do not become 
insane and these would be cut off if life segregation or sterilisation were adopted +.” 


But here again Dr Mott is using the data obtained from his index of relatives 
which shows a greatly exaggerated number of cases at the earlier ages among the 
offspring, and he thus greatly exaggerates the number of cases in which the children 
were born before the onset of insanity. No conclusion can be drawn from any but 
complete records of families. But apart altogether from this, many of these parents 
are themselves the children of the insane and much could be done to discourage 
such marriages. Unfortunately as we have seen Dr Mott directly sanctions 
marriage to those who remain normal till the age of 25. 


In further support of his view Dr Mott has stated that out of 642 females 
admitted to three London County Asylums in 1911, 148 were recurrent cases and 
of these 32 (21°/,) had children between their respective dates of admission. 
“The inference that can be drawn,” he says, “is that about one-fifth of the 
recurrent cases, or approximately one-twentieth of the female admissions have 


* If the term ‘‘age at onset”? has any real meaning. 
+ The italics are Dr Mott’s. 


Biometrika x 49 


382 Recent Studies of the Inheritance Factor in Insanity 


children after their first attack of insanity and of 31 such cases examined, 73 chil- 
dren were born after the first attack of insanity in the parent.” 


But have these 148 recurrent cases been followed up to the end of the repro- 
ductive period? Not at all. No ages are given and the cases are merely those 
which were admitted to Asylums in 1911, Dr Mott’s remarks being made in 
June 1912, so that no attempt has been made to follow them up. There is no 
justification for Dr Mott’s advice. 


There are many other points in Dr Mott’s work which deserve detailed exami- 
nation, but time will not permit more than a brief account of a few of them. 


It should be noted, for instance, that Dr Mott has used his index of relatives 
in London County Asylums as an argument in favour of the importance of the 
inheritance factor in insanity. His argument is as follows: 


“At the present time in the London County Asylums there are 725 individuals 
so closely related as parents and offspring, brothers and sisters. A priori, this, to 
my mind, is striking proof of the importance of heredity in relation to insanity, 
for we cannot suppose that 20,000 of the 44 millions of people in London brought 
together from some random cause would show such a large number closely related 
eis i0 9) oc 

But Dr Mott has not attempted to give, and I doubt if he ever will be able to 
give, a satisfactory estimate of the number of relatives in even a random sample 
of the population, and the population of asylums is far from being a random 
sample of the general population—there is for instance an extraordinary divergence 
inage. Yet without definite information on this point it would be impossible to 
say whether insanity is inherited or not—that is if we had to depend solely on 
Dr Mott’s data, 


It should also be noted that in these cases Dr Mott has clubbed together 
every form of insanity, from congenital idiocy to senile dementia, except of course 
cases due to specific infections or trauma. I myself think that course is the only 
possible one. To anyone who has studied even a few pedigrees of mental defect, 
nothing is more striking than the extraordinary number of different forms of 
mental defect that may appear in the same family. 


Seven years ago, in a First Study of the Statistics of Insanity and of the 
Inheritance of the Insane Diathesis*, I was confronted with the same problem, 
and after a full consideration of all the available data and of the opinions of those 
medical men who were best qualified to express an opinion came to the conclusion 
that the only possible course was to group all forms of insanity together, with, 
of course, the exceptions I have already indicated. The whole question was dis- 
cussed very fully in my paper and it was there suggested that an even broader 
classification might be of service. This point of view met with some criticism at 
the time but nothing has occurred to alter it, and the study of the inheritance of 


* Galton Memoirs, No. II. (Dulau and Co.) 


D. Herron 383 


insanity in general or of an even broader degeneracy must always remain the 
first object of our studies. 


Any investigation of the inheritance of special types of insanity or degeneracy 
can only be carried out however on unselected material—on the records of com- 
plete families. The type of insanity is so closely related to the age of onset that 
any tendency to exaggerate the number of early cases, as in Dr Mott’s material, 
will entirely vitiate the conclusions drawn. Thus Dr Schuster’s conclusions as to 
the inheritance of special types of insanity based upon Dr Mott’s data* must also 
be rejected on the above grounds. 


Dr Mott’s index of relatives in London County Asylums is unfortunately of 
very little value in the study of inheritance in insanity. Progress can only come 
from the study of complete pedigrees in which every member of the family is 
entered, whether insane or normal, and the ages of the normal at the time the 
record was made are just as important as the age at onset of insanity in the insane 
members, for a statement that a young man of 20 has not been insane is of a 
very different degree of importance from the statement that a man of 70 has 
not been insane. 


In the papers I have cited the children of the insane if normal at 25 are 
advised to marry, and it is asserted that it is useless to attempt to discourage the 
reproduction of the insane since most of their children are born before the onset of 
insanity, and that we should rely on the Law of Anticipation to end or mend 
degenerate stocks. 


I have shown, I think, that the Law of Anticipation as applied to the insane 
has no foundation in the facts provided and that the advice given as to the marriage 
of the insane and of their normal offspring is fundamentally unsound and directly 
eacogenic. Much yet remains to be learnt regarding the inheritance of the insane 
diathesis, but no one who has studied the family histories of the insane can doubt 
that in ivheritance we have by far the most important element in the production 
of insanity, and in view of all the facts it is the obvious duty of the Kugenist to 
discourage, rather than to encourage, procreation by the insane and even by those 
of their offspring who appear to be normal. 


* Report on the Statistical Investigation of Relative Cards, 21st Annual Report of the London 
County Council Asylums Committee (1910), p. 95. 


49—2 


ON THE PROBABLE ERROR OF THE BI-SERIAL 
EXPRESSION FOR THE CORRELATION COEFFICIENT. 


By H. E. SOPER, M.A. Biometric Laboratory, University of London. 


In a recent paper* Professor Pearson shows that where one character is in 
multiple graded grouping and the other in alternative categories, greater or less 
than a given magnitude, the correlation coefficient admits of simple expression ; 
the assumptions being that the unmeasured character, B, has a normal distribution 
and that the measured character, A, has linear regression upon B. Under these 
conditions the data required are the numerical ratio of the alternative groups, 
the standard deviation of the measured character and the deviation from the 
general mean of this character of the mean of one of the groups. 


This expression is subject to greater fluctuations of value in samples of NV 
of the population than is the product moment form, especially where one of the 
groups is relatively small; and it is proposed to find formulae for the mean and 
second moment of the errors from this mean to a first approximation, that is to 
terms in 1/N. These will appear in terms of the correlation coefficient, r, of the 
original population (which will be supposed normally correlated) and the fractional 
frequency, f, in that population of the group possessing the Bree or positive 
intensity of the character put into two classes. 

Let y be the graded character and # the alternative character the intenser 
value of which is possessed by the fraction f of the population. Let %, y be the 
general means and o,, o, the standard deviations of « and y. Then it is shown 
in the paper that if 2’, 7 are the means of the group /, 

pelt ee (1), 
cy La 
on the assumption that the regression of y upon « is linear; and that if 2 be 
normally distributed this is equivalent to 
ona x Se ee HONORE AOR ON NEDSS SC > (2), 
Cy Z 


* Biometrika, Vol. vu. (1909), p. 96: ‘‘On a New Method of Determining Correlation between 
a Measured Character 4, and a Character B, of which only the Percentage of Cases wherein B 
exceeds (or falls short of) a given intensity is recorded for each grade of 4.” 


H. E. Soper 385 


where z is the ordinate of the normal curve cutting off the area 
f=4(1—a) or 4(1 42) 
as defined in Sheppard’s Tables of the Probability Integral. 
Now if Do, Pi; Pa, ete. 


be the moment coefficients of the whole population with respect to the character 


y defined by 
0S (Mats) Nett cess cat mener rience ersta es (3), 


[see bi-serial table below which is here to be looked upon as representing the 


general population] and 
Dy; Dia! Pz ete. 


are moment coefficients of the group n’(=/N) defined by 


(i 2 1S) (COOP I BL geese en erm gona races anesoo00 (4), 
we may write f= pp, fy’ =pi, J = Pr. Ty = V(po— pr) and 
mye oe, Ay at Ms 
aE scl AOE he) De 2 aR oe (5). 
V(p2— pr") 2 
—+ Grades of y in Bi-serial Table. 
[tea ES 1 ea ae, ae 
V ” ” ” 
< Ny Nye 3 
on | 
° 
2 ny Ns! Ns 
as} 
S 
5 | 
oO 
Ny Ny Ns 


n'/N=f 

In samples of WV the frequencies n,, n,, ns’ and consequently the moment 
coefficients p and p’ and the ordinate z are subject to fluctuation and the values 
of the correlation coefficient calculated from this formula will have a distribution 
of errors. Let 7 be the mean value in such samples, ér the deviation from this 
mean value in any sample when 6dpp, dp,, dp), ete. dz are the deviations in moments 


and ordinate. Then 
= 3 Pi + Spy’ — (po + 8p) (pi + 8p) 


7 +6) : oe geet te hess Cormeen (3): 
/ {po + Sp. — (p, + 6p,)"} x (2 + 62) SP 
To express 6z in terms of deviations’ of the moments we have 
Ls, 
Ufa a (Fee ae 8 bh 
V Qar 8) 


386 Probable Error of the Bi-serial Correlation Coefficient 


Hence to second order terms in powers and products of deviations 
1 1 2 
240s = ——— en eee 
N Qar 
=z{l—aéa—}(1—a’) (Sa)} 


at+éa 1 AD 
Sie eee 


a /Qar 
sa] m 
= —— Sa(@e ay 
0 J 2a 5 
Cy 


=2["(1-ag-40 -@) B+...J de 


=z |da—4a(da)}. 
It follows that . 


etb2e=2+a8f—5 (8f) 


,_ i ine 
=z+adp — oF (Ope )8 kei beeeeeee (10). 


At the same time that this value is put in (7) we may simplify the expression 
and subsequent algebra by supposing the graded character y to be measured 
from its universal mean value as origin and in terms of its standard deviation as 
unit of measurement in which case 

n=0; pr=of=1, and by () py 2) ieee ees (11), 
and since p, =f (7) becomes, 
mM zr + op,’ — fdp, — Spy Sp, 


Expanding to second order of deviations we find, 


P+ OF HT 64 Oy disaw- ccnp ease oven eeeeeeeee (13), 


where 6,, 6, are the first order and second order expressions 
1 ; ' 
o= a {dp,' — fdp, — zr dp, — ar 8p,'}, 
fa 


1 : 
n= : {har8p, dpy — - Spy Spo + = 5p, dpy — $6p' Sp, + 4 fOp, Sp. 
— Spy dp + $2r (Op,)? + S er (dp)? + (1 + 2a?) _ (5p. Seek (14). 
Taking mean values 
T= MEAN Og cooler eae eaten (GUS), 
mean 6, being zero since by (3), (4) 
mean dp, =S (mean dnsys”)/N =, 
mean 6p,’ = S (mean 6n,’¥;”)/N = 0. 


H. E. Soper 387 


Mean 6, is to be evaluated by the formulae 
mean 67,,5p) = (Putv — PuPo)/ N 
mean Op, dpy = (puso — Pu Pu )/N | 
mean 6p. 6p. =(p'utv — Pups )/N 
of piich the first two are well known* and the third may be proved thus: 
N8Spu = S (Sngye) = 8 (Eng ys”) + 8 (bn ys"), 
N Sp, =S8 (6ns' ys”), 
*, N?Spudpy =S8 {(Sns )? yst*} + S {Ong Sng’ ys yy} + S [Ons Sng Ys’ Ys}, 
where in the third sum s’ may or may not equal s. 
But mean (6n;)? =n; (1 — n/N), 
mean (6n; dry’) =— ns ns’ /N, 
mean (dn; dny”) = — ns'ng/N, 
the last whether s=s’ or not. Thus we findt 
mean dpudpr = (puso — Pu Py — pu" Pv) | N = (p'usrv— pupo)/N. 


Evaluating mean 6, es these formulae we find, 
7 Saye 73 tt ar (p2’ — P2Po ‘) SES = (py — Pi Po) 


A (pr — Pipo) — & (ps — py Po) + of (Ps — Pips) 


; ; as ; ee 7 
—(pr — Po Pr) + $27 (ps — pr’) + 3 2r (pa po!) +1 + 2a?) 5, (Pr — Po Y ena let); 


in which the undashed moments, being those of a normal curve with unit standard 
deviation, about its mean, have the values 
Pi=9, Pr=l, ps=0, pr=3, 
and the dashed moments beyond the first two, 
Sy i eae 
have values depending upon the nature of the frequency distribution of y and z. 
Assuming «, y normally distributed t 


? =|, depareat ay YT dandy, 


ae 1 : 
pi = i | tice TOA og diay 
a -2 Tv Se 


* See Biometrika, Vol. 11. (1903), p. 275: ‘‘On the Probable Errors of Frequency Constants.” 
K. Pearson. The second follows in exactly the same manner as the first, since the constancy of the 
total frequency dealt with is only involved, in deducing the relations (i) and (ii) of p. 274, of that 
paper. 

+ Dae =8 (n,/y.")/N. 

+ Since the moments appear in the term containing 1/N any errors in their calculation due to 
incorrect assumption of normality will not affect the present approximate formulae provided such errors 
are of the order 1/N. 


388 Probable Error of the Bi-serial Correlation Coefficient 


Putting y=rex +7 and integrating with respect to 7 for constant 2, 


ie l Ly? 
pi=| (ray +(1 ae r)} ——— @ 2% da 


V Qe 


ps = {(rx)3 3 (rv) (1 Re r2)} oe Pat ce ia 


Sat A eeett alta a) | ee ON ANET Oo ovctc caaiSookdde cane (19). 


When these values are put in (17), and terms collected, the mean value of the 
bi-serial correlation coefficient in samples of V is found to be 
Mi : eee 


a i ae fs cenaaee i: | Pees. (20), 


where f =1—f= WN. ‘ 


In the work of obtaining this approximation all powers and products of 
deviations above the second order have been neglected. The means of such 
terms in samples of NV involve second and higher powers of 1/N* and the present 
result is correct to the first approximation. 


Again squaring (13) and taking mean values and subtracting the square of 
(15) we find to the same approximation as before 


G,7 = mean (or) —imeanyo,” seen ene ene eee (21). 


The evaluation of mean 6, being carried out precisely mm the same way as 
mean 6,, the result is the second moment of deviations of the bi-serial corre- 
lation coefficient in samples of J, 


ofa (e- i s (1 = (1 +f)! ror| ieee (22). 


Writing the two results (20) and (22) 
r=T {1 + (be 4P 1) 


of =a Niyee Wir Erk ee (23), 


the values of 1, Yat and y, for values of $(1—a) [=the smaller of n/N, n/N] 
from ‘50 to ‘01 are to be found in table (24). 


* See Biometrika, Vol. 1x. (1913), pp. 97—99. 
+ xa for $(1+a) was tabled in Biometrika, Vol. 1x. p. 27, and the table is reproduced in Tables for 
Statisticians and Biometricians, p. 35, Cambridge University Press. 


H. E. Soper 389 


$(1-a) fa Xa" va 3 (1—a) fa Xa” a 
50 0354 | 1:5708 | 2:5000 20 — ‘0871 2:0414 | 2°8578 
“49 ‘0353 15711 | 2°5003 ‘19 — -1001 | 92-0898 | 2°8951 
“48 0350 | 1°5722 | 2°5011 ‘18 — 1146 | 2°1437 | 92-9364 
47 0346 | 15741 | 92-5024 “17 — ‘1308 | 2:2035 | 2°9825 
“46 0339 | 1:5766 | 2°5043 16 — 1490 | 2:2703 | 3-0341 
“4S 0331 1°5799 | 2°5068 15 — 1696 | 2:3453 | 3-0923 
“hh ‘0321 15839 | 25098 Ly = 1931 24303 | 3°1582 
“43 0309 | 1°5886 | 2°5134 13 — +2201 25272 | 3°2337 
“42 0295 | 1°5943 | 2°5177 ‘12 —~ -2513 | 2°6389 | 3°3208 
“41 0279 | 16007 | 275225 ‘11 — 2880 | 2°7687 | 34224 
“40 0260 | 1:6079 | 2:5279 10 — +3317 | 29221 | 3°5427 
39 0240 | 16161 | 2°5341 095 | — -3568 | 3-:0095 | 3°6115 
38 0217 | 1°6251 | 2°5409 090 | — +3844 | 3:1057 | 3°6873 
3 ‘0191 1°6351 | 2°5484 085 | — 4153 | 32119 | 3°7713 
36 0163 | 1°6461 2°5568 080 | — +4499 | 3°3299 | 3°8648 
85 0132 | 1°6582 | 25659 075 | — -4887 | 34620 | 3-9696 
‘Bh 0098 | 1°6714 | 25759 070 | — -5324 | 3°6110 | 4:0879 
33 0062 | 1°6858 | 2:5868 065 | — -5822 | 3°7806 | 4:2294 
32 ‘0021 1:7015 | 2°5986 060 | — -6403 | 3°9748 | 4:3777 
‘81 |—-0023 | 1°7186 | 2°6115 055 | — °7083 | 4:2002 | 4°5584 
30 |—-0070 | 1:7371 | 2°6256 050 | — *7897 | 4:4652 | 4°7723 
‘29 + |—-0122 | 1°7573 | 2°6409 045 | — °8868 | 4:°7829 | 5-0283 
emt —-Ol79.) || 17791 2°6575 040 | —1:0053 | 51715 | 5:3410 
27 | —-0241 1°8028 | 26755 035 | —1:1577 | - 56568 | 57362 
26 |—-0308 | 1:8286 | 2-6952 030 | —1:3556 | 62859 | 6-2485 
25 |—-0382 | 1°8567 | 2°7166 025 | —1°6308 | 7°1347 | 69481 
‘24 |—-0462 | 1:8874 | 2-7399 020 | —2:0272 | 83600 | 7:9572 
23 =|—-0550 | 1°9208 | 2°7654 015 | ~2°5263 | 10:3024 | 9-4975 
22 | — 0647 19574 | 2°7933 010 | —3:8889 | 13°9393 | 126086 
21 |—-0753 | 1:9974 | 2-8240 

| 
ee (24) 


The bi-serial value of the correlation coefficient has the standard deviation 
Op = (Ke — Wart ry/ VN, 
whilst that of the product moment value is 
op =(L—)/VN 
In table (25) a comparison of the values of the numerator is made for five 


values of r, for divisions at 0, ‘5, ...2°5 times the standard deviation from the mean 
of the ungraded character. 


Values of ,/(xa2— par? +74) for $(1-a)= 
Values 
? of | 
(a “500 “309 “159 067 023 -006 
‘00 1:00 E25 1°31 1°51 1:93 2°76 4°5 
225) “9375 1:19 1°25 1°45 1°86 2°68 4:3 
“50 ‘750 1:00 1:06 1°26 1°65 Dea, 4°0 
wy) "4375 “69 ‘75 94 1°30 1:95 3°2 
1:00 ‘00 OF oo “49 ort 1:13 1°8 
Seahie (25) 


Biometrika x wy 


390 Probable Error of the Bi-serial Correlation Coefficient 


Thus the effect of grouping and applying the bi-serial value of the correlation 
coefficient is to add 25°/, to the probable error in the most accordant case where 
r is zero and the division equal, whilst if r is as large as ‘5 and one group as small 
as 10°/, of the whole the probable error is nearly doubled. For higher values of 
r the errors of sampling, in the case of the product moment formula, grow smaller 
and ultimately vanish when 7=1,; but the bi-serial values are not invariable in 
samples drawn from a perfectly correlated population but possess a variability as 
high as *27/s/V in the most favourable case when the grouping is equal. 


If the standard deviation be calculated from the approximate formula, 


Graf [2 —7) IN wee (26), 


which may be written * = 
Or = (Na — 12) NO eee CAD 


the error of computation will not be great for values of f and r commonly met 
with as the following table compared with the last will show: 


Values of ya—-7? for $(l-a)= 
ip 

“500 309 "159 “067 ‘023 -006 

‘00 1°25 1°31 1°51 1:93 2°76 4:5 
"25 1-19 1:25 1°45 1°86 2°70 4:4 
“50 1:00 1:06 1°26 1°68 2°51 4°2 
‘75 69 wi) 95 1°36 2°20 3°9 
1:00 "25 31 51 93 1-76 3°5 

soo0n (28). 


The difference between the two expressions only reaches 5°/, when the 
smaller group is less than 7 °/, of the whole. 


It will not be necessary, excepting in small samples, to apply a correction 
to the bi-serial formula for r in virtue of the mean of samples differing from the 
population value. The correction is less than 1/Nth part of the value calculated 
unless one of the alternative classes is as small as 4°/, of the whole. 


I have to thank fellow members of the Staff for assistance in calculating the 
tables. 


* For a table of xa see Tables for Statisticians and Biometricians, p. 37. 


ON THE PARTIAL CORRELATION RATIO. 


PART I. THEORETICAL. 
By L. ISSERLIS, B.A. 


§ 1. The theory of non-linear regression in the case of two correlated variables 
is due to Prof. Karl Pearson*. He shows that regression ceases to be linear 
when the correlation ratio » differs sensibly from the correlation coefficient 7 and 
establishes criteria for parabolic, cubic and higher forms of regression. 


The present paper deals with the regression surface of three correlated variables 
x, y, 2, Where, though the regression of z on a, y cannot be adequately represented 
by an equation of type 


Ze 


= ETE ed (1), 


om ox oy 


the regression of z on w for a constant y and of z on y for a constant « is linear. 
A large proportion of the non-linear cases that occur in practice fall into this 
class. It will be remembered that Z,, in (1) denoting the mean of the array of z’s 
for a given w and y the coefficients y;, y;’ are partial regression coefficients and it 
will appear that just as it is necessary to introduce the correlation ratio 7 for an 
adequate description of non-linear regression of two variables, there must be 
introduced multiple or partial 7’s for the description of such regression in the 
case of more than two variables. 


We recall the definition and principal properties of .7,—the correlation ratio 
of y on 2, oq, being the square root mean weighted square standard deviations of 


the arrays of y: 
i S(Nzo"n,) “ SS {roy (y - In) 


(_—-7’)o,/°= Ory V Notes (2), 
Om,  S (nz Ya — Y)} 
Cee Umea 
7 ae Naeem yer an Tecrinas treats (3), 
and NGG ST) Cy Sua ne) ys wacnaweeteace seals (4). 


* Drapers’ Company Research Memoirs. Mathematical Contributions to the Theory of Evolution. 
XIV. ‘‘On the General Theory of Skew Correlation and Non-linear Regression.” 1905, 


50—2 


392 On the Partial Correlation Ratio 


' Here we are dealing with WV pairs of two characters A and B. ng of these have 
the character w of A. Y, is the mean of this #-array of b's. o,,, is the standard 
deviation of this array, om, is the weighted standard deviation of the means of the 


arrays and Y is the value of y given by the regression straight line, Le. 


This is the “best fitting” straight line (in the Gaussian sense) to the means 
of the arrays, and is the regression line when the regression is linear. 


§ 2. Consider now three correlated characters A, B,C. If N combinations 
of A, B, Care taken, we may denote by n, the number of these which have the 
character A =a and by nz, the number in which A has the value w and B has 
the value y. Let %, 7, Z be the mean values of the total population, and let Zz, 
be the mean of z for a given w and y. The frequency of A=a, B=y, C=z 


1S alte 


We define the correlation ratio of zg on « and y, which may be denoted by 
vyll;, or if no confusion is likely to arise by H,, by the equation 


esl] Seen une = 20) > (6). 


The triple sum in the definition can be written 
SSS {Nayz (2 — Z+ 2 — Zry)?} 
= SS {Ngy (Z — Zry)?} + SS {(Z — Zay)} X S [Maye (2 — Z)} + SSS {Nays (2 — 2)} 
= SS {nzy (Z — Zzy)?} — 2SS {gy (Z — Zey)?} + No?. 


SYS Oe Zo 
= eee De G2 GSE a oe oe (7) 


This is a generalisation of the property of ,, given by (3). 


Hence pilates 


Further, the “best fitting” plane tou the means Z,, is given by 


2-2 L—2 592) 
== Wh Sf eyg OP ctahnsnetes See eee 8 
Gy a On ue Cy (8), 
Te Pyan 
where fg ven vealid tn ctde debe ne (9), 
ary 
> __Vyz— Vaz xy 
2 aa (a (10). 
Tay 


Let ,,R, denote as usual the maximum correlation of z with any linear function 
of « and y, then 


rayliter = 32a + Ys Tzy PIA MOOD CAS AHOC CI OO ORDO GODOOAt.00 (a): 


y2 2 > > > 
_ Pye Fen = WV yz x28 ay 


Ley 


u ™ 


L. ISSERLIS 393 


Subtract (11) from (7) after replacing 7, and rz, by the appropriate sums, and 
we obtain 


Laifet aS Ayla | N oy 


= SS ay (Z — Zn)? -—S | = (2—2Z)(a@—@) vs} == Jew ge (2-Z)(y-y) ve} 
a y 


Big ee ce Opie RNa 2s. Cs ye 2 PT Day 
= Sis ay (2 — Za)” — Nay = (Zxy — 2) (@ — @) Y3 — Nay 7 (Zay — 2)(Y— 9) Ys | . 
a y 


Using (8) this can be written 
[eyll? — my R?| No? = SS {nay (Zry — 2) (Zay — Z)} 
= SS {Nyy (Zry — Z)?} + SS {xy (Z— 2) (Zn — Z)} ...(13). 
But SS {ney (Z— Z) (Zxy — Z)} 


=SS | (1 


9 yp / * ane 12 
= No, (Ys02" zx — Ys’ F2 — 33 Vay Fz + V3 T2V yz — V3V3 Vay Fz — V3 oz) 


“LB 1¥—Y 
Oz+ 42 J 
Ox Cy 


Co See wey 
Oz i eta z= Cray 3 is Fa) 
x y 


PMG aval ine — Va is May) ict 8 ys — Ya — Yalay))  <seessoersccseeeenies (14). 


Using the values of y; and y,’ given by (9) and (10) we see immediately that 


Tea — V3 — Ys Tay =Tyz— Ys — ¥3V xy = 0. 
Hence (13) becomes 
(cyl Ey reg RENIN op ISI SI OPI C29) 1 ee (15). 


This is the generalisation of (4). We deduce that ,,H,=,,R, if and only if the 
regression is strictly linear, that otherwise ,,H,>,,R7 and by (6) that ,,H?< 1. 


§ 3. These properties and definitions can be extended to the case of m 


variables #,, 2, ... fm. We now use » mt, for the mean of a, when a, a3, ... &m 
are given and denote by S a summation extending to the variables a, a, ... wp. 
Le. 497 


If we define the correlation ratio of x, on a, #3, ... 2m by the equation 


UN Git mace eda) iS IC gel ali) Maan easiest lesesed (16). 


1...m 


We can deduce in the same way as in § 2, the relation 


AM ote mev ds Wioc=3 112) (Gp coe tn i Oe (17). 
2... 
In order to generalise (15) we recall that the “best fitting” linear function of 
the variables a, 3, ... 2m to the mean o5 mi, 18 
Oe ye ie oeay meee ong ew me (18), 


on O71 G2 Oo; om 


394 On the Partial Correlation Ratio 


where by, = a and Ryq 1s the minor with its proper sign of the element in the pth 
row and qth elamn of the determinant 
= ecg rime 
Fayre ec eee cere Pee: 


i ry 


lm Vme OOo Ih 


while the maximum correlation between any linear function of a, 73, ... ®, and 
TIS ast le, Were 


R-R 
—o3,..miy? = pe = bisa Ost is os ie Olin Tun ate oe (20), 
ll 
S {ny (@, — DB) (2, — %)} S {ns (a — Z,) (#3 — X,)} 
= by ce a Dis 6 7 
Noo, Noo; 
8 {7m (@ — &) (2m — Lm)} 
+... + Dig —— Neto = aa ee (21). 
Subtract (21) from (17), noting that n.= S {rom}, etc., we obtain* 


3...m 


(, 3, eek FS iOn3 30. plate) No? 


= = oO a i 
= 8 {ae (%— oan | =f 2 |r = Bis (a, — X;) (a, a x) 
P 2 


2... 


oO 3 ae 
apiece Ar S {ran --. Dine (ay ae 7.) (Gan ro Zn) 
Im om 


as SENG oO — ae =e 
== {No mM (&, = away +8 {n, ee Dis (a. = Lz) ae a x)| Tee 
9 C2 


2...m 


m ne 


oO pee = = 
+8 \" a bin (Ca ee Lm) ( m& fad a) 


= = Np o = 2 = 
= 8 |» (2, oe 2...) + Nem 4 Dy (2, aaa 2) (2. mA ae ;) 7 gsi | 


2... 2 


2... 2 


Oj —t 
= 8 tm (Gone 2) jaan — a+ Drs = (2 — ae) Tee 


m 


Ge ee 
a Dim es (Ge zm) eae eee (22), 


using (18) this equation becomes 
(s, 3, Ravel motores ana) No? > (Hen Cena at 2) (2, me — X1)} 


2... 


= 7/8) gen (om = DG) + 8 {fea an CG a X,) Goan aan X,)} 
2... 


2...m 


* By an extension of the notation described at the beginning of this section S3., denotes a 
summation with regard to the variables 73,14, ... 2,3 11,2,...m iS the frequency of a particular com- 
bination of the characters w,, x2, ... x,, While ny, is the frequency of the combination 2, 2. 


L. IssERuis 395 


But S {ts on (X, a ZL) Cann =e X,)} 


2. IN 


( = ee. geen 
Ly — Ly Xz — Vs er Ly — Ve 
=S ie (- Dig ——— 0, — dy O—---) (2. my —Mt+ Dis O,+... (24), 
o on o 


2...m ( 2 


and -Noyon%is= S {2m (@, —%;) (o—- ) 


1... 


=S (eee (a2 = Hy) Cant aa X,) 
2... 


with similar values for 7,3, 7.3, ete. 
.. the right hand side of (24) 
= = by? yp — Dy? oP — 043012047123 « -- 


oe Gy Osis bs0 or To3 — bis Oe Bis Dr Cie vaee 


= GOS (- Tro — Oya — Bis 05 — Oya t'n4g — « — Gina) 
+ o;7b,s (= 113 — Dis — Diy Ig — «- ) 
Goh Gam - ~.nens seen nee eIAeER SIR CIMA CE Ie eRe Eero (25). 


Each line in (25) is identically zero from the definition of the b’s and the properties 
of the determinant in (19). 
S {Ns mM @ m% a aa) 


2 2 2...m 
Hence payrelae = 2 ealagi => —— = WN Fe aii RIM 8s oie sks (ouslekele| oxsiejsie’s (26), 
Orie 


so that the fundamental properties proved by Professor Pearson in connection 
with the correlation ratio 7, hold for the generalised H defined in this section. 
In particular equation (26) shows that a necessary and sufficient condition for 
linear regression in multiple correlation of m variables is that 


O35 Pilla == pyr means 


For in this case the mean value of any array of 2, will lie on the “best fitting ” 
m-dimensional plane. 


§ 4. The regression surface of z on xy being assumed of any particular type 
the constants in the equation may be determined (i) by the method of least 
squares, i.c. by making the sum of the squares of the deviations 2,,—@(xy) a 
minimum, z= ¢(w#,y) being the regression surface, or (11) by giving such values 
to the constants that the correlation between z and (a, y) shall be a maximum. 
When ¢ (a, y) is of the second degree the two methods lead to identical equations 
for the determination of the coefficients. 


The same equations are also obtained if the surface be “fitted” to the means 
by the method of moments. There is, however,a distinction to be observed. The 
equation z= (a, y) when the regression surface is of specified degree contains a 
definite number of constants and the first two methods will give exactly as many 
independent equations as there are constants to be determined. The method of 
moments will give as many equations as we please if sufficiently high moments 


396 On the Partial Correlation Ratio 


are used, including of course the equations given by the “least squares” method 
or maximum correlation method. Even without introducing high moments, when 
there are three variable characters new equations may be obtained by the method 
of moments, by combinations of characters which do not arise in the other methods. 
The method of moments is most convenient for our purpose, but we shall only 
employ those equations which can also be justified by (say) the method of least 
squares. 

For convenience let the origin be taken at the mean of the three characters so 
that2—y—2— 0) Jet atytze denote 

SSS {Mayet y'2"} _ Pasty (27). 


a ee 
Noiiay Ge Ox Oy Cz 


With this notation 7,, and ga, are identical; when z does not appear in the 
product, it is sufficient to write @,s 4 

The most reasonable next approximation to make when a linear function 
(a, y) does not adequately represent the statistics is (a, y)=a quadratic 
function of a, y. 


a 2 2 
Let ELI rg OEE a (28). 


om Ox Gy Gxoy Oz Oy’ 


Multiply (28) by mz, sum for all values of w, y, z and divide by V 
OS + Cry HOA SF 200. -2. sess dea near eeeeee (29). 


’ : ales x De Z 
Multiply (28) in turn by “ae times —, 7, ae ae x and sum as before, 
ap IO i) Oa 


and we obtain 


Toy = 0 + OT yb CQnry Ap CGasit Pf nyte s0+efecioes seine nee enna (30), 
Pyz SO Olay + COmp + Oxy Hii Qys seceese osese nee Gnee eee eter (31), 
Qayz = Uxy + Aqary + OY aye + CYary + CYasy tI Yays veeceeceveseees (32), 
Gare = A+ Agast+ gary + CYary + CQat Hf Qary? vrecerececrscecscneas (33), 
Pye = AA Adaye + bq ys + COays + CYnry +S Qys -cescersensercereoees (34). 


Actual numerical fitting shows that in many cases e and f are small compared 
with c*. This is the case when the regression of z on « for a constant y and of 
z on y for a constant « is linear. We shall therefore confine ourselves in the 
present preliminary paper to the case where we may write 


2 gi BY ee (35). 


Oz Ox Cy OyGy 


Here for constant # or y the regression of z on y or of z on @ is linear. 


* Cf. Census of Scotland, 1911, Vol. 111. p. xuvi1. where Mr G. Rae obtains by moments the 
regression of fertility on age of husband and wife. Let W=age of wife, H=age of husband, C=number 
of children in completed marriages. He finds 

C yy = 20°149493 — 0°555812W — 0173804 — 0:002846 W2 — 0:003494.H?2 + 0:012675 WH. 


See also the paper by E. M. Elderton in the current part of this Journal, pp. 291—295. 


L. IssERLIS 397 


Equations (29) to (32) become, when the regression surface is given by (35), 


ORICA CTir lia oe ME sia a i sremiebe cin ccs ara ie aheteers sie yee scsgiee (36), 
Me MAO OI <IaICQ Bayh SAAR acide cing Sammarn imum netics tee’ (37), 
fig, OS OMe Cis, a apeons chochaed ciaden Mace se aeeog sare (38), 
Ynys = ON ny + BGary + OQmya + CQ ary? — szerereveecsteseerens (39), 


= — OF xy + Gary + bgaye + Care by (36). 


Solving these equations we obtain 


a oe b ee 
Wey ae. day Paz | ie | ary 1 Voz 
1 Yay? Vyz Yay? Tay Tye | 
aye § Jary2— Pry — Jaye | | Gury? — Tay Ya2y — Yacye | 
Cc . 1 
MMe te ol. a Gy ceases (aw): 
Pry 1 Vyz Poy 1 Tay? 
Gary = Yay? aye Gary Yay? Yay? — Tray 


we have already denoted the partial regression coefficients by y; and y;' so that 


_ Paz Vy2V ay + Tye Vaz ay 
(i acm ae and y; eure rie (9) and (10). 
In addition let VEO al) A ae (41), 
1l—Pryy 
and BeNLaE ies DOI Df ter oh ik gihee ct es (42). 
Ll Pry, 
After some reductions the determinants in (40) yield 
CYNIC Umm eee Nett ee Aah MN otto ena te 1 hil ve (43), 
b = Ys + ch sLbiaisiisteteraip/siiaretehareta'atalstanets seve) a seksi) eaactve rege cera ia tis yay fins (44), 
Or z2 ar br yz F Yxyz 
oS : Far ASE ae ee ee 45), 
Bary + Pay + Qary — Tay ( 
also NC pa Sete Cte ee Me aa cag Giasiis Gusta ot (36). 
Note that Vs Gary shits Qe N= NO ag AID Tay) eeosscswsodiscest sree (46), 
and Ys Vza a Ys. Vey = pln: (cf. eqn. 11 Js 
oy: S LY =F Y i 
§5. By definition (1—-yzH?)= ue ee : Zmy)?} ; 
oy 


nin using (35) 
ql H, ) =5 ea (= === ee = ug. CLY 
ayttz 


N \o, Cia aye SCRAORY 
=1-2812 (d+ 44 art.) Pash 
CO; Oy Cy Oxy N 


++ 04+ 04 Cqgaye + 2dergy, + 2abray + 2acqry + 2beqzy, 


Biometrika x 51 


398 On the Partial Correlation Ratio 


= gy? = — Arey, — Wry, — WAayz+ C+ B+ + Cqary2' 
+ 2cdryy + 2abrey + 2acqary + WCqayr-..-+-+- (47), 
= W(— Pag t A+ Orgy + CQa2y) 
+b (= 1yz +0 + zy + Cay?) 
+ (= Gayz + Ar zy + gary + bqays + C422) 


+d (d+ cryy) 
— ON 24 — Diyz = COpye «2-2 ee ee a ..(48). 
The first four terms vanish by eqns. (36)...(39), 
"sagy ll? = Oz +: Ola) HCOnye ees ee ee (49). 


If we now insert the values of a, b, ¢ from (43)...(45) in (49) 
ay, = (3+ CO) Pon + (ys + ch) r zy + CY xyz 
= Ys" en + Ys Tye + 6 (Ore + Oye + Gayz) 
(Or ze + PV yz + Vaye) 
ary? + Qayeh + Garye — Mary’ 
(Gaye (1 = Pry) = Gay (ToeTay — Tyz) — Yaty (TyeTay — Tae)\ (30). 


2 ey? + Qa — 2x2 27 % 
| = Pig) i ay? + q a == y Vary | (1 — r2,y) 
ay 


== aylees ete 


or yl a 


It follows from (50) and (15) that 


2 a2 % = 2 2 Ta. 
a (51) 


Yox2y2 — Vary > 


If we eliminate q,,z (which is a triple moment troublesome to caleuiae) 
between equations (45) and (49) we have ! 
ay? = OP zy + Dr yz + CGayz 
= OP ag + Dr yz + C (Gary — May) + ACQary + beqaye 
= Tox (3+ CO) + Tey (ys + Ch) + C (Gay2 — Tay) 
+ (Y36 + 09) Yary + (Ys'¢ + CD) Qanye 
=O [Gaye — Tay + Oqary + Gaye] + ¢ [Ore + Pry + Ys Gary + 9 dey] 
+ Ys" ex + Ys Vey 7 
= yl? + C [Gare — Pry + Oqary + $Yxy2] by (46) and (11), 
— Pay t Fay — 2qery? Qty =. 


ay Ht? — ayl? at | dev — Ty 


tea 
He RZ : 
Hence C= $$ et ee (52). 
2 Paty + Paye — 24ary2QaruT ay 
Gary? — Pay — = Seeage 
me ay 


This value of c? is positive by (51). 
Equation (52) shows that ,,H7=,,R? is a necessary condition for linear 
regression, which we have already proved in equation (26). 


L. IsskEr.is 399 


The regression surface of z on a, y is, with the values we have now obtained 
for the constants 


BZ Vg — Tyel ny © © Vyz—Vax2V ay Y 


as = = te 

om 1l- Prey Ox Us rey ae 
ar we el 2 ale: : {= ay? = Yary & e + Vay Yary — Cay Y — y 

Ee ee EY et ee a = pee 

4 7 Payt + Paty — 2Qa2y Yay?? ay pales ae ae 7 

sys 1 ay Ye 
1 a Tey ( 
(w7—%)(y—-Y) 
+ — pS “ey piavetanallslasons (53) 
Tiley 


The terms in the first line give the ordinary regression plane. In most cases 
the regression does not differ widely from linearity so that ,,H?—,R? is small. 


§ 6. We must now get some idea of the relative magnitudes of qa, Jay, Ge2 
and moments of lower order. 


First with regard to g.., which is equal to Leis 
: On Oy 
aye S (Nyt? y) = S (Nz ix") (54) 
Vary No? oy No? Oy Si aheieje<e/e\ere:e. 6 cise: e-0c8 Blexeie reece . 
If the regression of y on w be linear 
(1 7y ory] 
Ox : = 
Vary = S = Vyy VB, 


No,20, 
so that q,2, is zero if the regression is linear and the frequency of a symmetrical. 
In fact 22 — 1m VB, = 0 is the same as Pearson’s criterion for linear regression given 


by €=0 (Skew Correlation and Non-linear Regression, p. 30, Eqn. (1xix)). 


We may obtain a good approximation for qx, by considering the regression of 
y on « to be parabolic. This is a natural assumption to make if the regression 
surfaces of w on z, y and y on z, be also of the hyperboloid type we are discussing 


for the regression of z on a, ¥. 


For with origin at mean we may write 


@, Zia, 
Bip eg IS de” 


Ox Cy Ox Oyoz 
Hence keeping y constant and summing for the 2’s 


Bh Ss, WEI hyey 


(on Oy Oe Gy Ozs 
Zz y V yne — 7 ye (Y == i 
But rye Pye" Yee Nymd Pye 2 _ y: ie a V By’ Shee) | x 
Ge © By — By —1 (ey Cy | 


* See Pearson: l.c. Eqn. (lxv) (where Y,, is a misprint for X,), and py”, 8; refer to the distribution 


of z. 


Px 


400 On the Partial Correlation Ratio 


“" is a quadratic expression in y if we remember that h being of order 
Ox 


Ci SS Wan ane oa peas 
NE ay is of the same order as Vyn2—7yz*. 
But the relation $2 (x? — 12y) — € =9 is satisfied when the regression of y on 


# is parabolic t. 


Here d,=8.—Pi—1, €=€— Tay VB, €= qayt. 
aes Qaty — Ty VB, = Vang — Try V Bo — By — 1 veteeee eee eee eens (55). 
Similarly Quy — Tay V BY = Vn =a V By By = 1... were (56). 


The use of these approximations will save the direct calculation of gq, and 
zy provided we can determine the signs to be attached to Vz,?—7'sy and 


Vine — 1x. This is often easily done by inspection of the regression curve whose 


a y: we — Tay | ae — & 
= a5 —* a- -———1;8 
oy : ox B.— By —1 lox" VB, ox 5 
We can approximate to qax,2 as follows||: 


SS (tay ay?) _ S {nea (o*y, + (Va — yh 


da? = = : 
avy Nox2o,2 NoZo,7 ) 


equation is 


where Y; is the value given by the regression straight line and oy, x nz the 
second moment of the array of y’s for a given x, about the point Vz. 


S {n, 2 (Y2—Y)? ; 
But . - i 2 _ 8 (matrtve, Vo 
a Oy 
= Saag . 7 
S (Ny, 2? a7 yz) 
Thus ee ae mae 
Gary? Nox? oye +P Be vy > 
Bhd S(n yo? ) 
Similar] gf a Zs 
: Me Yaty Nozo/ Be Pay; 


or, so far without approximation 


ai 9 _<¢ Y 5 ¢ 
us = S(nz,a?o7y,.) + S (ny yxy) 
i oe cy) 
Ae 2Naeo,7 


ap $ (By ote B,’) Preys 


* It is noteworthy that the hypothesis that regression of z on a, y although of 2nd degree is such 
that regression of z on x for a constant y is linear leads to the result that the total regression of z on x 
is parabolic. 

+ Pearson, l.c. p. 28, Eqn. (lxiii). 

{ Pearson, l.c. Eqns. (li), (xlv) and (xiii). 

§ Pearson, l.c. Eqn. (lxv). 

|| It can be found fairly directly by tabling to the squares of the variates, when we need a simple 
product moment. In a later part of this paper some comparisons of actual and approximate values for 
numerical cases will be found. 


L. Isseruis 401 


Now the mean values of oy, and o?y, are known to be 
Cy liter) -and,-o,° (L—73,,,), 


and the deviations from these are usually somewhat irregular. Rarely can we 
do anything better than assume them to vary with a slight linear variation from 
the mean. For example 

Oy, = Oy (1 —7°gy) (1 + Ax), 


where A is small. In such a case 
Were (1 — rg) (144.8) 


or to a fair degree of approximation*, we may put 
: Vx2y2 = 1 Tey ar 4 (B, ar (SEO) Pevy, 

and thus write 

Quy — May = 1+ %(B2+ Bs — 4) Pay 

= 1+ y+ $(8,— 3+ Bo — 8) roy. 
The latter part of this expression vanishes if the frequency of the # and y variates 
be mesokurtic. It can of course be retained if desired but its product with 
V wy? — ey RZ will usually be of the second order. If we write 
v=t(Bo- 3+ B, — 3), 


we find the approximate regression surface 


Lim aa PyzVay (@ — @) ns Pyz — VazV ay We Y) 
Oz 1- Tey Ox t= Tey Oy 
/ se — ah? (w— 2) (y-Yy) PG (57) 
Tee Was | os = ae : 
This equation (57) enables us to express approximately the multiple ,,H, in 
terms of the simple 7, 292, xMys yNx- 


+ 


§7. To obtain this connection between the multiple ,,H, and the simple 7’s 
we may proceed as follows : 


OF 


Zay =d f= ae 


, origin at mean, 
oz ox Gg y Oy 


Hence keeping y constant and summing for the «’s 


“Ud 4 a — +2 uf ce ae ee ee (58). 
z & y FnOy 
» _ SSS {ney (2) — 2) les SSS (ayz 27) «¢ 
ue Luar aon No? No? ie 
SSS {Nayz (Zey — 2)? — SSS {Nay2? 
pee Yy. y fend aye” oi 
ead ay He = No? BeUNag -~ 


* T.e. we are neglecting terms of the second order as AV Bi. 


402 On the Partial Correlation Ratio 


page ls — yn? = SSs ka (“ (wy) | cy (a — =») (2d+a @ tty | 4 by 
ae ox oe oy oy Cy 

| Gyan 

= 5 (22) (a+ 2) [2 (a4 on) (24%) 

= ss |" (FS Ox #) (a+) 2 d+) + (a2) Ox ) . 

by By 0, — 2,2 CY\? Nay) 

99 a ie ay y Cy Nay 

s {(a+ ale jo Ses Ox ta + as |e On? (a+) N} 


by CY\" | Nay LU — X,? 
=0+8 \(a+ 2) (s Neorg )t. 


Now S {Naya} = 8 {(w — dy + Ly)? Noy} = ny (on + 0+ 2,7), 
x we 


a : CY \7 Ox,? N, 
ay H? — yn =8 (a+ “f) at : 


On 


ns fo+ 70-9 


y 


= ee + 2et 0), 


Cy oO, 
ay? — ne = (1 = yn2) (EO)... eee (59). 
Similarly eye ee = (8 —27,))(0F aC) nee eee (60). 


Remembering that a= y;+c0, b=+¥;' +ch we get from (59) and (60) 


a ra) be ay? — Ne 
woe See ee : SF le ¥5%3 (sh — ys 9) 


1- Qa 
; tiGe {ys o— 39 — Ob (ya = Ys, 0)} AERC (61). 
From the values of ys, y;, 9, @ in (9), (10), (41), (42) we obtain easily 


yzYary — Vaz Vay? 
/ yz 4ary az Yay 
Yah — ys = "een Teeter 
Seay 
/ Vaz Q x2 oan Tye Jay? 
y3 6-730 = 
1 — 1 xy 


Ya  — 29 — Of (ys — ys 9) 
— P